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CERVICAL  RADICULOPATHY  AND  CARPAL  TUNNEL  SYNDROME:  A 

prospective  determination  of  the  reliability,  diagnostic 
PR"cv,  AND  predictive  validity'  of  commonly  used 

CLINICAL  EXAMLNL4TI0N  MEASURES  AND  PATIENT  SELF-REPOR 

INSTRUMENTS 


Robert  S.  Wainner,  PhD 
University  of  Pittsburgh,  2000 

Background:  Patients  with  n.dicuiopa.hy  means 

and  occupational  costs  annually.  There  is  a  nee  to  es  .  ^  K  detcmunc  thc  reliability  diagnostic 
for  the  diagnosis  of  both  conditions.  The  purpose  0  1  s  -  diagnosis  of  cervical 

radiculopathy  and  c*^Pd  ^vT^^^ivroUrotandardized  electrophysiological  examination  of  their 

40  males  (mean  age- 45.0  ■  presenting  symptoms  and  electrophysiological 

affected  iimb.  Patients  received  a  ^  J£L*.  exam  and  sussed 

examination.  Two  physical  therapist  raters,  me  At  six-weeks,  aU  subjects  were 

condition,  performed  a  standardized  clinical  exarmm  to  „  ,  Rcs„lt,  Thirteen  subjects 

maned  the  same  sed-mpori  forms  initially  completed  at.  ««* ~  ^  ^  ^  ^ 

(16%)  and  31  (38%)  subjects  were  diagnosed  wit  dijerent  clinical  examination  variables 

respectively.  The  following  levels  of  reliability  f^rito  Good  (Kappa=  ,40-  75),  and 

Uvvere  poor  (Kappa  <  4U.  ice  >  niimher  7  for  cervical  radiculopathy  had  a 

syndrome  had  aroepmbie  Idtelihood  ratios^  ^  ^  pre*a„rs  had 

definitively  acceptable  Likelihood  ratio  (  -  of  worsened  status 

acceptable  Likelihood  ratios.  Based  on  patient  carpaUunnel  subjects  had  acceptable 

and  ,8  predictots  of  improved  sums  fo,  non-surgicall,  «-  rp  was  identified 

LAelihood  ratios.  Pot  each  drastic  and  prediettve;  —a  ^  ^  ^  ^  ^ 

that  would  produce  post-test  probability  chan8e!"’^  tab|e  reliability.  None  of  the  definitive 

Conclusion:  The  majority  of  test  items  -  (hat  would  result  „  post-tes,  probabilities 

LR+  values  for  individual  tests  or  tes,  Custers  <■-■£  p,e«ve  vaMty  of 

i  r  than  33%  More  precise  estimates  are  required 
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Thank  you,  Jesus  Christ,  for  life,  salvation,  and  the  ability  You  so  graciously  granted  me 
to  complete  my  Doctoral  program. 

Thank  you,  family  You  have  been  supportive  and  patient  beyond  compare.  Diane,  you 
are  a  treasure:  I  only  hope  I  can  inspire  and  encourage  you  as  much  as  you  have  inspire 
and  encouraged  me.  Rachael,  Rebecca,  Andrew,  Bethany,  and  Joshua:  thank  you  or  e 
light- heartedness,  perspective,  and  frequent  “reality  checks"  you  brought  and  continue  to 
bring  into  my  life.  Jordan  Lee,  thank  you  teaching  me  not  to  take  myself  too  senously 
for  teaching  me  how  to  be  a  better  “juggler”,  and  for  what  you  have  taught  all  of  us  a 
the  true  meaning  of  life  in  such  a  short  period  of  time;  your  family  welcomes  yo 
and  Dad,  Coach  and  Mrs.  Bonner,  I  could  not  have  come  this  far  without  you  an  am 

indebted  to  you  all. 

Thank  you.  Committee  Members,  for  your  guidance,  sacrifice,  and  support.  Julie,  a 
special  thanks  for  providing  me  with  a  role  model  of  what  a  Doctoral  student  s  ou 
well  as  your  close  mentorship  and  friendship. 

Thank  you,  to  the  many  other  people  who  have  made  my  acadenac  accomplishments  here 
at  the  University  of  Pittsburgh  possible.  Please  grant  me  your  forgiveness  as  space  does 
not  allow  me  to  properly  acknowledge  your  invaluable  contributions. 


TABLE  OF  CONTENTS 


1.0  Introduction 


2.0  Statement  Of  The  Problem 

2.1  The  Impact  of  Cervical  Radiculopathy  and  Carpal  Tunnel  Syndrome  on 

Society  and  on  Diagnostic  Decision  Making  . 

2.2  Diagnostic  Tests  Considerations . 

2.2.1  Levels  of  Efficacy . 

2.2.2  Research  Methodology . 

2.2.3  Metrics  and  Interpretation  of  Test  Properties . 

2.3  Reference  Criteria  or  "Gold  Standards" . 

2.3.1  Cervical  Radiculopathy . 

2.3.2  Carpal  Tunnel  Syndrome . 

2.3.3  Patient  Outcome . 

2.4  Critical  Appraisal  of  Clinical  Examinatton  Measures  and  Self-Report 
Instruments  for  Cervical  Radiculopathy  and  Carpal  Tunnel  Syndrome . 

2.4.1  Common  Clinical  Examination  Measures . 

2.4.2  Common  Patient  Self-Report  Measures . 


3.0  Research  Hypotheses 


4.0  Research  Design  and  Methods 

4. 1  Inclusion  Criteria  . 

4.2  Methods  . 

4.2.1  Procedure . 

4.2.2  Patient  Demographic  Data  and  Past  Medical  History 

4.2.3:  Standardized  Electrophysiologic  Examination . 


V 


4.2.4  Diagnostic  Tests ... 


....72 


4.2.4. 1  Clinical  Examination  Procedures  .. 

4. 2. 4. 2  Patient  Self-Report  Measures . 

4.2.5  Patient  Outcome  Gold  Standard . 

4.3  Data  Analysis . . . 

4.3.1  Hypothesis  #1  -  Reliability . 

4.3.2  Hypothesis  #2  -  Diagnostic  Accuracy . 

4.3.3  Hypothesis  #3  -  Test  Predictive  Validity . 


72 

.75 

.75 

.75 

.75 

.80 

.83 


5.0  Sample  Size  Estimation 


. 85 


. 86 

6.0  Results  . 

8  6 

6  1  Study  Sample  and  Diagnostic  classification . 

97 

6  2  Hypothesis  #1  -  Reliability . 

,  _  .  A  . 106 

6.3  Hypothesis  #2  -  Diagnostic  Accuracy . 

6.3.1  Diagnostic  Characteristics  of  Single  Examination  Items 

for  Cervical  Radiculopathy  and  Carpal  Tunnel  Syndrome . 1 

6  3  2  Diagnostic  Characteristics  of  Single  Examination  Items  for  Carpal  Tunnel 

. 121 

Syndrome  Subclassified  Groups . 

124 

6.4  Hypothesis  #3  -  Predictive  Validity . 

6  4  1  Predictive  Validity  of  Single  Examination  Items  for  Cervical 

. 124 

Radiculopathy  Subjects . 

6  4  2  Predictive  Validity  of  Single  Examination  Items  for  Carpal  Tunnel 

. 125 

Syndrome  Subjects . . . 

127 

6.4. 2.1  Surgical  Intervention  Gold  Standard . 

6.4  2.2  Patient  Perception  of  Change  Gold  Standard . 136 

6. 4. 2.2.1  Improved  Carpal  Tunnel  Syndrome  Subjects . 136 

6. 4. 2.2. 2  Worsened  Carpal  Tunnel  Syndrome  Subjects . 141 

154 

6.5  Hypothesis  #4  -  Test  Item  Clusters . 

6.5,1  Diagnostic  Characteristics  of  the  Cervical  Radiculopathy 


. 125 


Test  Item  Cluster . 


. 155 


vi 


6,,2  Carpal  T-  S^ome  Tes,  Kern 

6.5.2. 1  Diagnostic  Characteristics 

diagnosis  Test  Item  Cluster . 

6. 5.2.2  Diagnostic  Cha,—  of»e  Carpa,  Tonne,  Syndrome  ^ 
Surgery  Test  Itern  Cluster . 

,5  2,  Diagnostic  Chatactens,,cs  offteCarpa,  Tnnne,  Syndrome  ^ 

Improved  Test  Item  Cluster . 

6. 5.2.4  Diagnostic  Charactenstics  of  to  Ca^l  Tnnne,  Syndrom . ^ 

Worsened  Test  Item  Cluster . 


. 161 

7.0  Summary  and  Conclusions  164 

7.1  Hypothesis  #1  -  Reliability .  . l67 

7.2  Hypothesis  #2  -  Diagnostic  Accuracy .  . 16g 

7.2.1  Cervical  Radiculopathy . . .  j  59 

7.2.2  Carpal  Tunnel  Syndrome .  ^7q 


7  2.2.1 .  Subclassification  Group  Results 

7.2.12  Collective  CTS  Group  Results . 

7.2.3  Summary  of  Diagnostic  Accuracy  Findings . 


7.3  Hypothesis #3 -Predictive  Validity .  . 175 

7.3.1  Surgical  Intervention  Gold  Standard .  n? 

7.3.2  Change  Gold  Standard .  j7g 

7  3  2.1  Worsened  Patients .  ^ 

13.2.2  “Improved”  patients .  ^ 

7.3.3  Predictive  Validity  Summary .  ^ 

7  4  Hypothesis  #4 -Test  Item  Cluster .  . i82 

7.4.1  Variable  Reduction  Procedures .  Jg2 

7.4.2  Diagnostic  Accuracy  TIC  s .  jg4 

7.4.3  Predictive  Validity  TIC  . .  ^5 

7.4.4  TIC  Summary .  . 186 

..189 


7.6  Summary 

7.7  Conclusion 


. 190 

Appendix  A  -  Study  Flow  Chart,  Patient  information,  and 

Appendix  B  -  Demography  and  Self-Report  Forms . 

Appendix  C  -  Clinical  Evaluation  Forms .  ^ 

Bibliography 


List  Of  Tables 


Table  1. 


Table  2. 


Table  3. 


Table  4.  F 
I 

Table  5 
Table  6  1 

Table  7 
Table  8 
Table  9 

Table  10 

Table  11 


Table  12 


Reported  Reliability  and  VaUdUy 

Extremity  Neurologic  Examination  in  . 36 

Reported  Reliability  and  Validity  Coefficient  for  the  Motor  and  Sensory  ^ 
Examination  in  Patients  With  Carpal  Tunnel 

Reported  Reliability  and  Validity  Coefficients  for  Cervical  R . P . ^ 

BLepoitedReU^ility  and  Validity  Coefficients  for  Carpal  Tunnel  Syndrome  ^ 

^iS^SStudiesandAssociated^ Normal 
,  Conditions  Suspected  by  Providers  . ^  . 88 

- » 

i  Descriptive  statistics  o  J  Conduction  Study  Test 

9  Descriptive  Statistics  of  Subjects  Median  nerve  Conduction  S 

ResultS  uv  pindinas  in  Standardized  Examination 

10  Frequency  of  needle  Electromyography  Find  . 93 

„  Frequency  of  Needle  Electromyography  Findings  . „ 

Addition  to  the  Standard  study  ResuUs  by  Carpal 

.  12  Descriptive  Statistics  of  Median  nerve  . 95 

—  S""::— Duration  by' 


Table  13 


tunnel  Syndrome  u„c_  Duration  by 

Descriptive  Statistics  of  Subjects  ^ 


n  1-  nri  •^nation 


Table  M  Frequency  of  Spontaneous  Activity  in  the  Abductor  Polhcus  . % 

Subjects  with  Carpal  Tunnel  Syndrome .  .  97 

Table  15  Qualifications  of  Physical  Therapist  Raters 

Table  16  Reliability  of  Clinical  Examination  Variables  with  95%  C  onfidence  In  ^  ^  ^ 

Table  17  C^al Radiculopathy: 

Likelihood  Ratios  (LR-),  and  Posttive  Ltkel  . 107  -  108 

Confidence  Intervals  (95CI) . _  '  N  tive 

Table  IS  CarpalTumtelSyn^—  ^^os(lr+) With 95% 

Likelihood  Ratios  (LR-),  and  Vosm  . 109-  HO 

Confidence  Intervals  (95CI) .  .110 

T- » 

Table  20  Descriptive  Staust.es  of  Self-Report  . nl 

SeVetity . ; . ,  t  nriel  Syndrome.  Sensitivity  (Sn),  Specificity  (Sp), 

Table  21  Mild/Moderate  Carpal  y  ukeUhood  Ratios  (LR+) 

Negative  Likelihood  Ratios  (LR-),  122  _  123 

95.A  Confidence  —  (' ^  ’  Sensi.ivity  ^Specificity 

Table  22  Pronounced/Severe  Carp  positive  Likelihood  Ratios  (LR+) 

(Sp),  Negative  Likelihood  Ratios  (LR-),  . 123-124 

With  95%  Confidence  Intervals  (95C1) . . 125 

■  Statistics  of  Neck  Disability  Scores  at  o  o 
Table  23  Descriptive  Stat  s,  cs  res  for  Improved  and  Control 

Table  24  Descriptive  Stat.st.cs  of  Self-Report  . ,26 

SubieCtS . r; . f  Se,f  Report  Scores  for  Worsened  and  Control  Group 

Table  25  Descriptive  Statistics  of  Self-Report  . 

SubieC,S . -  p  edictors  for  Carpal  Tunnel  Syndrome  Subjects. 

Table  26  Surgical  Intonrention^^rcto  ^  tjve  Likelihood  Ratios  (LR-),  and 
Sensitivity  (Sn),  Spec.fi  ty  P  ^ 

Positive  Likelihood  Ratios  (L  )  . 132-133 

Intervals  (95C1) . 


i  r\ 


Table  27 


Table  28 


Table  29 


Table  30 


Table  32 


,7  Predictors  for  Improvement  tn  Ratios  (LR-), 

Subjects:  Sensitivity  (S"X  Spec .  «  3  ^  ^  ^  (95C1) 

and  Positive  Likelihood  Ratios  (L  )  . 141-142 

and  Positive  Likelihood  Ratios  (LR  )  . 148-149 

.  .  /t  n_^-  >2.0  or  LR-  — -50) 

.  29  Acceptable  Likelihood  Ratios  of  Surgical  Pre  ic  ors  . ^ 

e  30  Acceptable  Likelihood  Ratios  of  Improved  Predict  . ^ 

(LR+  ^  2.0  or  LR-  ^.50) . 

Table  31  Acceptable  Likelihood  Ratios  of  Worsened  Predictors . ^ 

(LR+  >  2.0  or  LR-  S.50)  . ;  ^lopathy:  Quest, on  9. 

to  32  Test  Item  Cluster  for  the  Diagnosis  Distraction:  Sensitivity 

Valsalva,  Biceps  brachii  Muscle  Stretch  reflex,  and  D-  ^  ^ 

(S„)  Specificity  (Sp),  Negatrve  Likelihood  a  10  .  156 

^  ’  P  Jn  •  rTR+I  With  95%  Confidence  Intervals  (95CI) . 

Likelihood  Ratios  (L  Tunnel  Syndrome:  Question  10, 

,ble  33  Test  Item  Cluster  for  the  D'a8"°®  °  ,  ^  ponicus  Muscle 

Question  3,  Symptom  Seventy  c  ’  Negative  Likelihood 

rsrn-s. 

and  Positive  Likelihood  Ratios  (  )  . 15g 

T.W  »  — 


Table  33 


.  .157 


Table  34 


•  /i  n  \  „n(4  Positive  Likelihood  Ratios  (LR+) 
(Sp),  Negative  Likelihood  Ratios  (LR-),  . ]60 

With  95%  Confidence  Intervals  (95CI) .  .  , 

T  bl  36  Test  Item  Cluster  for  Predictors  of  Worsened  Non-surgica  su  jec  s 

Table  36  lest  item  .... .  %  Fear  Avoidance  Behavior 

Carpal  Tunnel  Syndrome:  Quest, on  2(m  >,  Spectfici.y 

Questionnaire  (>54%),  and  Wrist  Rano  ^  Ratios  (LR+) 

(Sp),  Negative  Likelihood  Ratios  (LR  ),  161 

With  95%  Confidence  Intervals  (95CI) . 


vn 


List  of  Figures 


,  .  variables  Will  Demonstrate  a  Fair  to  Good 

Figure  21  Hypothesis:  Clinical  Examination  Variables 

(Karra1'  .40  -  .75)  or  Good  (Intraclass  . 105 

Correlation  Coefficient-  .75  -  .90)  .  Demonstrate  a  Poor 

F,gure  22  Hypothesis:  0-1 Ration  Coefficient  .75, 

(Kappa<  .40)  or  Poor  to  Moderate  (ini  . !0; 

Figure  2,  Receiver  Operating  Characteristic  Curve  for  Cervical  F . 

Degrees .  r^rviral  Extension:  Cut-off  55 

. m 

Degrees .  r  -cal  Left  Rotation:  Cut-offs 

Figure  25  Receiver  Opera,  ing  Characteristic  Curve  ffirCerv,  . m 

<48  and  <57  Degrees .  Analog  “Worst”  Pain 

Figure  26  Receiver  Operating  Characteristic  Curve  for  Visual  ^Ita  . 113 

rating:  Cut-off>7.5  centimeters...... .  severity  Scale:  Cut-off 

Figure  27  Receiver  Operating  Characteristic  Curv  . 113 

>5  -  . . r,>rmvcf„r  Functional  Status  Scale:  Cut-off 

Figure  28  Receiver  Operating  Characteristic  Cu  . 114 

LR+  >2-0  or  LR-  <-50) . -  n  Variables  Will  Not 

, ,  Hypothesis'  Cervical  Radiculopathy  Clinical  u  5 

Figure  31  yp  tnh1e  Level  of  Diagnostic  Accuracy . 

Demonstrate  an  Accept  Examination  Variables  Will 

LR+  >2  0  or  LR-  <  50) .  lamination  Variables  Will  Not 

13  Hypothesis:  Cervical  Radiculopathy  Climca  .  . n6 

Figure  33  yp  T  1 0f  Diagnostic  Accuracy . 

Demonstrate  an  Accepts  Specificity  Findings  of  Clinical 

Figure  34  Definitively  Acceptable  Sens, trvity  and  Specificity . „* 

Examination  V  ariables . 


Figure  35  Definitively  Unacceptable  Sensitivity  and  Specificity  Findings  of  Clinical  ^  ^ 

Examination  V  ariables .  |20 

Figure  36  Summary  of  Acceptable  Ltkelihood  Rat, os  .  128 

Figure  37  Receiver  Operating  Character  Curve  or  Cut-off 

Figure  38  Receiver  Operating  Characteristic  Curve  for  Func, tonal  . ^ 

Figure  39  Receiver  Operating  Charactensttc  Curve  for 

Questionnaire  B:  Cut-off>31  points . ^  Sen$oty 

Figure  40  Receiver  Operating  Charactensttc  Curve  or  n  . u9 

Latency:  Cut-off  >3.0  Milliseconds^  ^  Median  Motor  Latency: 

Figure  41  Receiver  Operating  Charactens  tc  . 13q 

C  •  **  r,,rvP  for  Involved  Median  Motor 

Figure  42  Receiver  Operating  Characteristic  Cuive  for  ,30 

Amplutude:  Cut-off  >4800 M— . ^  Cu,of& 

Figure  43  Receiver  Operating  Charactensttc  urve  )31 

Figure  47  Receiver  Operating  Charactenstic  . l37 

W1. . 

off>3.4  centimeters . rr^Xpear  Avoidance  Behavior  Scale 

Figure  50  Receiver  Operating  Charactenstic  . ,39 

A:  Cut-off  >54% . 


.  .  „  r  Involved  Median  Palmar  Sensory 
Figure  51  Receiver  Operating  Characterise  Curve  . ,39 

Amplitude-  Cut-off  >43  Microvolts .  ^ _ 

Figure  52  Receiver  Operating  Characteristic  Cuwe  for  Involved  Median  M 

Amplitude:  Cut-off  >81 10  Microvolts . . ■  . 

„  Hvpothesis  Carpal  Tunnel  Syndrome  Clinical  Examination  Vanab.es  W 
Demonstrate  an  Acceptable  Level  of  Predictive  Validity  for  Improvemen 

or  Sp  >  70  or  LR+ >2.0  or  LR- <  50) . 

Figure  54  Hypothesis:  Carpal  Tunnel  Syndrome  Clinical " 

Not  Demonstrate  an  Acceptable  Level  of  Predictive  Validity  ^ 

Figure  56  Receiver  Operating  Charactenstic  Cuive  for  Sy  p  . H6 

Figure  57  ^l^peraung  dmmcteristic  Curve  for  Functional  Status  Scale:  Cut-off^ 

Figure  58  ^cehmr  Operating  Characteristic  Curve  for  Visual  Analog  Scale  (worse):  ^ 

Demonstrate  an  Acceptable  Level  of  Predicti  ^ 

Sp  >  70  or  LR+ >2.0  or  LR- <  50) .  .  , ,  win 

Figure  60  Hypothesis:  Cmpal  Tunnel  Syndrome  Clinic,  Examination  Variables 

Not  Demonstrate  an  Acceptable  Lev,  ofPredictive  Validity . ^ 

for  Worsening . 


1 


l.o  INTRODUCTION 

This  multi-center,  prospective,  diagnostic-test  study  proposes  to  evall'a‘ 
commonly  used  ciinicai  examination  and  patient  self-report  measures  W  ^ 

predict  outcome  in  patients  with  suspected  cervical  (  same 

cnrpa,  tunne,  syndrome  <CTS).  Thts  study  w,H  “  "  _  at  6wks 
clinical  examination  and  patient  self-report  measures  p 

undergo  a  standardized  electrophysiological  exammatron  (EMG/NC 

upper  quarter.  Foliowing  the  EMG/NCS  — ion,  pauents  w,  a  so  ^  ^ 

battery  of  ciintcal  exanunation  measures  and  patten,  self-report  mea 

be  repeated  by  a  second  examdner.  Patient  status  will  be  determmed  a,  6wks  by  a 

questionnaire  that  inciudes  a  giobai-rating  of  change 

have  received  surgical  intervention.  The  spectfic  atms  of  thts 

1  Inter-rater  rel.ability:  clinical  examination  measures  commonly  used  to .evaluate 

ICC=  75  -  90)  to  excellent  (K-  >.'->>  } 

same  patient  is  evaluated  by  two  different  physical  therapists 

2.  Test  Diagnostic  Accuracy:  individual  items  from  the  climca,  examination  measures 
if  nnrt  instruments  will  demonstrate  acceptable  diagnostic  accurac 

CTS)  when  compared  to  a  neural  impairment  reference  criteri 

3  Test  Predictive  Validity:  individual  items  from  the  clinical  examinatron  measures, 

^  u  x:\ac /Mrs  findings  will  demonstrate  acceptab 

parient  self-repor,  instruments,  and  the  EMG/NCS  «n  ,  Ss  ^  ^  ^  ^ 

diagnostic  accuracy  values  (Sn  or  Sp  >  .70  or  LR+  >  2.0  - 

condition  (CR  or  CTS)  when  compared  to  a  patient  outcome  "  ^  ^ 

4.  -t  Item  Custers  (T,C):  an  optrmum  and  Pa—u s  and 

clinical  examination  measures  and  patient  self  rep 
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demonstrate  acceptable  diagnostic  accuracy  and  predictive  validity  values  (Sn  or  Sp  >  .70 
or  LR+  >  2.0  or  LR-  <  .50)  for  their  respective  condition  (CR  or  CTS)  when  compared  to 
a  neural  impairment  reference  criterion  and  when  compared  to  a  patient  outcome 
reference  criterion. 

2.0  STATEMENT  OF  THE  PROBLEM 

This  section  will  cover  four  topics:  1)  the  impact  of  CR  and  CTS  on  society,  2) 
reference  criteria  for  CR,  CTS,  and  patient  outcome;  3)  diagnostic  tests  considerations; 
and  4)  critical  appraisal  of  existing  of  clinical  diagnostic  test  technologies  for  CR  and 

CTS. 

2.1  The  Impact  of  Cervical  Radiculopathy  and  Carpal  Tunnel  Syndrome  on  Society 
and  on  Diagnostic  Decision  Making 

Patients  with  neck  pain  and  CTS  are  frequently  encountered  in  primary  care,1  physical 
therapy,1’2  and  a  variety  of  medical  specialty  practices  that  include  orthopedics,  physiatry, 
neurology,  and  neurosurgery.  The  prevalence  of  neck  pain  has  been  reported  to  range 
between  16-18%  for  the  middle  aged  population  and  approximately  10/o  of  the 
population  will  develop  neck  pain,  with  or  without  referral  of  pain  into  the  upper 
extremities,  during  any  given  month.3  Estimates  of  the  number  of  people  who  will  have  at 
least  one  episode  of  neck  pain  in  the  course  of  their  lifetimes  range  from  j3  to  50%. 
Although  the  exact  number  of  patients  who  develop  chronic  neck  pain  is  unknown,  there 
is  some  evidence  that  it  may  be  substantial.  Thirty-two  percent  of  patients  in  a  large, 
prospective  study  were  noted  to  have  moderate  or  severe  neck  symptoms  at  a  minimum 
10  year  follow-up  period.5  Cervical  radiculopathy,  which  most  often  occurs  as  a  result  of 
irritation  and  compression  from  a  herniated  cervical  disc  or  osteophyte,6  is  but  one  of 
many  possible  disorders  that  can  give  rise  to  neck  pain  and  disability.  However,  data 
related  to  socioeconomic  impact  of  neck  pain  as  a  result  of  CR  could  not  be  located. 
Because  CR  is  thought  by  some  to  be  one  manifestation  of  neck  pain  resulting  from  a 
degenerative  continuum,7’8  the  pain  and  disability  specifically  attributable  to  CR  may  be 

considerable. 
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CTS  is  the  most  common  nerve  compression  disorder  of  the  upper  extremity  with 
reported  prevalence  rates  ranging  from  .  1  to  2%  in  the  US  population9  "and  affects  as 
many  as  2%9  to  15%  of  workers  in  high  risk  industries.12  In  addition  to  the  frequency  of 
occurrence,  CTS  treatment  complications  and  the  percentage  of  patients  with  recurrent 
symptoms  are  sobering.  Approximately  200,000  patients  undergo  surgical  release  of  the 
volar  carpal  ligament  annually.13  According  to  Mackinnion’s  review,  7%-20%  of  these 
surgically  treated  patients  fail  to  obtain  relief14  and  the  percentage  of  patients  who 
experience  a  recurrence  of  symptoms  after  steriod  injections  into  the  carpal  canal  ranges 
from  8%-94%.15 

Cervical  radiculopathy  and  CTS  can  produce  similar  signs  and  symptoms  that  make 
distinguishing  between  the  two  conditions  difficult. 9pp'10’11  These  signs  and  symptoms 
may  include  pain,  sensory  disturbances,  and  weakness  of  the  upper  extremity.16’17  In 
addition,  there  is  evidence  that  a  small  percentage  of  patients  with  these  symptoms  are 
affected  by  both  conditions  concomitantly.18'20  Due  to  the  similarity  of  presentation  in 
patients  with  cervical  radiculopathy  and  CTS,  many  of  the  same  examination  measures 
are  often  used  to  evaluate  patients  suspected  to  have  either  condition.  This  is  done  in  an 
attempt  to  differentially  diagnose  or  discriminate  between  the  two  and  thereby  "rule-out" 
one  condition  or  the  other.16’21 22  However,  unless  the  diagnostic  properties  of  a  given 
test  or  measure  are  known,  differential  diagnosis  and  informed  decision  making  cannot 
occur  in  a  quantifiable  and  interpretable  manner. 23,24pp120'125  Unfortunately,  the 
diagnostic  properties  of  tests  and  measures  used  for  the  clinical  examination  of  patients 
with  suspected  CR  or  CTS  are  largely  unknown  or  not  well  established.25 

2.2  Diagnostic  Tests  Considerations 

Advances  in  technology  and  the  availability  of  sophisticated  laboratory  tests  have 
increased  our  diagnostic  power  for  certain  disorders.  This  selective  increase  in  diagnostic 
power  and  reliance  on  quantitative  diagnostic  tests  have  led  some  clinicians  to  view  data 
obtained  by  these  procedures  as  “hard”,  or  objective,  and  data  obtained  from  the  clinical 
examination  as  “soft”,  or  subjective. 
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This  viewpoint  has  led  many  clinicians  to  rely  on  clinical  laboratory  tests  for  establishing 
a  diagnosis.  However,  data  should  be  judged  by  their  power  and  not  by  their 

24pp.  19-21 

appearance.  ^ 

It  is  clear  that  both  CR  and  CTS  can  result  in  a  substantial  amount  of  suffering  and 
disability.  In  addition,  both  conditions  result  in  significant  medical  and  occupational 
costs  annually.  9,26pp',tM1  There  is  a  definite  need  to  establish  cost-effective,  reliable,  and 
accurate  means  for  the  diagnosis  of  both  conditions.  Aside  from  accessibility  and 
ecomomic  considerations,  these  tests  would  be  even  more  valuable  if  they  were  useful  for 
predicting  patient  outcome.  The  effort  required  to  develop  and  identify  such  tests  is 
formidable:  appropriate  research  methodology  must  be  employed;  an  adequate  gold 
standard  to  determine  presence  of  condition  and  patient  outcome  must  be  identified;  and 
diagnostic  test  properties  must  be  reported  using  metrics  that  allow  for  quantification  of 
test  results  and  their  probabilistic  interpretation.  . 

The  clinical  examination,  which  consists  of  history,  physical  examination,  and  manual 
test  procedures,  is  once  again  increasingly  relied  upon  in  this  era  of  medical  cost¬ 
cutting.27  There  are  four  purposes  or  activities  for  which  the  clinical  examination,  in 
particular  the  history,  has  been  shown  to  be  a  an  extremely  powerful  tool. 2800  These 
four  purposes  are:  making  a  diagnosis;  ruling  out  diagnostic  hypotheses;  identifying 
disorders  in  early  stages;  and  establishing  a  prognosis.31  Indeed,  with  the  exception  of 
patients  suffering  from  endocrine  and  alimentary  disorders,  information  obtained  from 
the  history  alone  has  been  shown  by  several  studies  to  be  sufficient  for  establishing  a 
diagnosis  63-  88%  of  the  time  in  patients  seen  at  outpatient  medical  clinics.  ‘  The 
physical  examination  of  the  patients  in  these  studies  provided  enough  information  to 
establish  the  diagnosis  in  most  of  the  remaining  cases  and  routine  and  laboratory  tests 
contributed  to  the  diagnosis  only  in  3-14%.29  The  ability  of  the  clinical  examination  to 
predict  how  patients  would  be  managed  produced  similar  results.28  29  Another  example 
of  the  diagnostic  power  of  the  history  is  a  battery  of  four  specific  questions  called  the 
CAGE  which  are  related  to  drinking  behaviors.  This  particular  battery  of  questions  is 
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more  sensitive  and  specific  than  any  laboratory  or  physical  examination  finding  for  the 
diagnosis  of  alcoholism.32 

Despite  the  demonstrated  value  of  the  clinical  examination,  investigations  of  the 
precision  and  accuracy  of  the  clinical  examination  have  lagged  behind  similar  studies  of 
laboratory  tests.31  Sackett  gives  five  possible  reasons  for  this:  1.  Such  investigations  are 
challenging  to  design  and  arduous  to  execute;  2.  Clinical  diagnoses  seldom  reside  in  a 
single  finding  but  rather  are  usually  derived  from  a  pattern  or  cluster  of  findings;  3 .  A 
lack  of  interest  by  clinical  investigators  in  true  clinical  research;  4.  Pecuniary  interests  in 
high  technology  research;  and  5.  Belief  by  many  physicians  that  the  “art”  of  diagnosis  is 
incapable  of  being  elucidated  and  defined  by  scientific  investigation.31  Recently  there 
has  been  a  renewed  emphasis  on  the  clinical  examination.  The  Journal  of  the  American 
Medical  Association  now  publishes  an  ongoing  series  of  articles  entitled  “The  Rational 

27 

Clinical  Examination  Series”  that  is  devoted  to  research  of  the  clinical  examination. 
International  groups  have  also  been  established  whose  goal  is  fostering  research  efforts  of 
clinical  examination  procedures  by  providing  information  and  a  collaborative  forum  for 
clinical  investigators  33 

Very  little  high-quality  research  has  been  reported  regarding  the  diagnostic  properties  of 
specific  clinical  examination  procedures  for  patients  with  disorders  of  the  neuromuscular 
skeletal  system.  Despite  the  numerous  text  books  devoted  to  the  description  and 
application  of  diagnostic  tests  for  neuromusculoskeletal  lesions,34’35’35  descriptions  of  the 
diagnostic  properties  of  the  tests  are  almost  uniformly  omitted.36  However,  the 
lamentations  over  the  current  knowledge  fund  and  calls  for  research  ring  hollow  when 
there  is  no  plan.27  This  study  will  assess  the  reliability,  diagnostic  accuracy,  and 
predictive  validity  of  several  common  clinical  examination  measures  and  patient  self- 
report  instruments  used  to  evaluate  patients  with  suspected  CR  and  suspected  CTS 
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2.2.1  Levels  of  Efficacy 

The  primary  purpose  of  diagnostic  tests  is  to  provide  clinical  information  which  can 
discriminate  among  disease  states  and  thereby  improve  patient  management.  However, 
other  purposes  of  diagnostic  tests  include  screening  asymptomatic  individuals  for  disease, 
monitoring  the  course  of  a  disease,  and  establishing  a  prognosis.37'38  Fineberg  has 
proposed  a  hierarchical  approach  to  the  assessment  of  diagnostic  tests39  that  has  been 
expounded  upon  by  Shwartz37  and  Deyo  et.  al.40  This  hierarchical  approach  consists  of 
evaluating  diagnostic  tests  at  different  levels  of  efficacy.  These  levels  of  efficacy  are 
categorized  as  technical,  diagnostic,  therapeutic,  and  outcome  and  are  described  below. 
Technical:  Refers  to  the  ability  of  the  test  procedure  to  demonstrate  adequate  safety,  be 
accessible  to  patients,  and  have  reproducible  results.40  Inter-rater  reliability  is  one 
measure  of  a  test’s  technical  efficacy  and  is  a  pre-requisite  for  establishing  test  validity.41 
Diagnostic-  Diagnostic  tests  are  utilized  to  determine  the  presence  of  a  target  disorder  in 
either  asymptomatic  or  patient  target  populations.  Diagnostic  accuracy  is  usually 

TO 

assessed  by  comparing  a  test’s  results  with  those  of  an  external  reference  standard.  The 
external  reference  or  “gold  standard”  used  for  comparison  is  the  most  accurate  and 
appropriate  method  of  determining  the  presence  or  absence  of  a  target  disorder  and  is 
usually  costly  and/or  involves  a  moderate  to  high  degree  of  risk.38,40  Therefore,  clinicians 
utilize  diagnostic  tests  that  are  less  costly  and  involve  lower  risk  but  are  still  effective. 
Therapeutic  &  Outcome-  These  are  the  highest  levels  of  efficacy  for  a  diagnostic  test  and 
arguably  the  most  important.  A  highly  accurate  diagnostic  test  is  no  guarantee  that  the 
test  is  useful.  The  true  value  of  a  diagnostic  test  is  the  ability  to  determine  a  course  of 
treatment  or  predict  treatment  outcomes  through  its  application.37,38  Another  aspect  of 
outcome  efficacy  is  the  cost  effectiveness  of  a  diagnostic  test  in  comparison  to  alternative 
diagnostic  strategies.40 

The  technical  and  diagnostic  levels  of  efficacy  for  tests  and  measures  included  in  this 
study  will  be  assessed.  In  addition,  follow-up  data  collected  at  6  weeks  will  allow  an 
approximation  of  the  outcome  level  of  efficacy  for  the  tests  and  measures  assessed  in  this 
study.  None  of  the  diagnostic  tests  for  CR  considered  in  this  study  and  only  a  few  tests 
for  CTS  have  been  assessed  at  the  therapeutic  or  outcome  level  of  efficacy. 
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2.2.2  Research  Methodology 

The  most  appropriate  research  study  design  for  an  investigation  is  determined  by  the 
question  being  asked.42  For  example,  the  randomized  clinical  trial  is  considered  the 
paragon  for  assessing  the  effectiveness  of  a  treatment.43  Similarly,  optimum 
methodological  principles  have  been  proposed  to  assess  the  efficacy  of  a  diagnostic 
test.37'44'46  There  are  three  basic  considerations  when  assessing  the  diagnostic  properties 
of  a  test.  The  first  is  the  gold  standard  or  reference  criterion  to  which  the  test  in  question 
is  compared.  The  second  is  the  spectrum  of  patients  to  which  the  test  is  administered  or 
applied.  The  third  and  final  consideration  are  the  procedures  used  to  control  bias.  Each 
of  these  considerations  will  be  discussed  below. 

Gold  Standard-  The  gold  standard  serves  as  a  reference  criterion  by  which  properties  of 
the  diagnostic  test  in  question  are  determined.  Although  the  gold  standard  is  more 
accurate  than  the  test  being  compared  to  it,  is  also  usually  more  costly,  more  time 
consuming,  and  involves  more  risk  to  the  patient;38  hence  the  need  to  develop  a  simpler 
and  less  costly  diagnostic  test  that  can  accomplish  the  same  purpose  with  minimal  loss  of 
accuracy.  Procedures  that  define  anatomic  and  physiologic  abnormalities,  including 
surgery,  are  often  used  as  gold  standards.40  Other  less  conventional  gold  standards 
include  expert  clinician  opinion  and  clinical  course  or  outcome.47,48  All  gold  standards, 
no  matter  how  good,  have  some  degree  of  imperfection37,45,49  and  what  constitutes  the 
single  “best”  gold  standard  is  often  the  subject  of  much  debate.40  Resolving  these 
dilemmas  may  depend  on  the  intended  clinical  use  of  the  diagnostic  test  being  assessed 
and  the  best  available  standard  may  often  be  “silver,  bronze,  or  tin”  in  hue  instead  of 
“gold”.40 

Patient  Spectrum-  This  term  refers  to  the  range  of  features  found  in  the  patient  sample 
used  to  challenge  or  assess  the  diagnostic  properties  of  a  test.44  The  pathologic,  clinical, 
and  co-morbid  components  of  the  target  disorder  must  be  considered  when  assembling 
the  patient  sample  that  will  be  used  to  assess  the  diagnostic  test  being  evaluated.  The 
pathologic  component  refers  to  the  extent  of  disease  process,  such  as  localized  versus 
extensive  cancer.  The  clinical  component  refers  to  features  such  as  chronicity  and 
severity  of  symptoms.  The  co-morbid  component  refers  to  co-existing  pathology 
unrelated  to  the  disease  of  interest.  Each  component  may  adversely  affect  the  positive  or 
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negative  diagnostic  accuracy  of  the  test  in  an  unpredictable  fashion,  depending  on  the 
disease  and  diagnostic  test  in  question.  For  example,  a  test  that  performs  well  with 
patients  whose  disease  process  is  mild  may  perform  poorly  with  patients  who’s  disease 
process  is  advanced.50  Patients  who  serve  as  controls  should  have  conditions  with 
pathologic  features  or  similar  signs  and  symptoms  that  might  be  easily  confused  with  the 
disease  of  interest.  Including  these  types  of  patients  as  controls  is  useful  for  assessing  the 
number  of  false  positives  a  test  will  yield  and  thus  provides  a  meaningful  interpretation  of 
test  specificity.44  Almost  any  test  can  distinguish  between  severely  diseased  patients  and 
healthy  control  subjects.  The  true  challenge  of  test  validity  occurs  when  a  study  includes 
control  subjects  that  resemble  the  population  of  patients  to  which  the  diagnostic  test  will 
be  applied  in  clinical  practice.45 

Biases-  For  each  patient,  the  investigator  must  determine  whether  the  diagnostic  test  is 
positive  or  negative  and  if  the  disease  condition  is  present  or  absent.  If  these 
determinations  are  not  independent,  a  false  index  of  test  diagnostic  accuracy  may  result. 
Control  must  be  exerted  for  several  types  of  biases  that  include:  work-up,  diagnostic 
review,  test-review,  and  incorporation.44  Different  synonymous  descriptors  have  been 
used  by  other  authors  to  describe  these  biases.40’45  Work-up  bias  occurs  when  the  result 
of  a  test  affects  the  subsequent  clinical  work-up  needed  to  establish  the  diagnosis  of  the 
target  disorder.  For  example,  a  patient  with  a  negative  test  may  have  a  less  intense  work¬ 
up  or  may  not  even  have  the  gold  standard  procedure  applied  to  them  since  they  are 
thought  to  be  disease  free  based  on  the  results  of  the  test.  This  type  of  bias  can  lead  to 
under  diagnosis  but  not  over  diagnosis.  Diagnostic-review  bias  occurs  when  the  result  of 
the  diagnostic  test  being  assessed  affects  the  determination  of  whether  the  target  disorder 
is  present  or  absent  and  may  result  in  over  diagnosis  as  well  as  under  diagnosis.  Test- 
review  bias  occurs  when  the  presence  or  absence  of  the  target  disorder  is  known  to  be 
established  and  affects  the  subjective  interpretation  of  the  diagnostic  test  being  assessed 
and  can  also  lead  to  over  diagnosis  or  under  diagnosis  as  well.  Incorporation  bias  occurs 
when  the  test  in  question  is  incorporated  into  the  evidence  used  to  establish  the  presence 
of  the  target  disorder.44 
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Other  potential  difficulties  and  issues  to  consider  when  assessing  the  accuracy  of  a 
diagnostic  test  include:  inter-rater  reliability;  whether  the  test  was  performed  singly  or  in 
combination  with  other  tests;  what  metrics  were  used  to  quantify  test  efficacy;  if  the  test 
procedure  was  operationally  defined;  and  if  the  setting  and  population  it  was  applied  to 
were  clearly  defined. 38!40,44?45 

Although  many  of  the  preceding  issues  seem  straightforward  and  intuitive,  it  is  clear  from 
the  literature  that  sound  methodological  criteria  are  often  not  adhered  to  when  assessing 
diagnostic  tests.  Sheps  et.  al.51  reviewed  129  articles  against  7  methodological  criteria 
identified  as  being  important  for  diagnostic  test  research.  Overall,  74%  of  the  studies 
failed  to  adhere  to  more  than  four  of  the  seven  criteria  and  revealed  the  following:  68% 
employed  a  well-defined  gold  standard;  32%  operationally  defined  how  tests  were 
interpreted;  interpretation  of  test  results  was  blinded  in  40%,  approximately  20%  used  the 
terms  sensitivity  and  specificity  incorrectly;  and  the  influence  of  disease  prevalence  and 
practice  setting  were  considered  in  only  19%. 51  A  qualitative  review  of  the  literature 
dealing  with  the  accuracy  of  diagnostic  tests  for  low  back  pain  revealed  major 
methodological  shortcomings  in  most  studies  and  only  19  out  of  36  articles  scored  over 
55  out  of  100  points.  Research  methodology  employed  in  the  development  of 
diagnostic  tests  must  possess  the  same  rigor  currently  required  for  clinical  trials  of 
treatment  effectiveness.  Not  adhering  to  sound  methodological  criteria  may  result  in 
improper  patient  management44  and  a  confounding  of  clinical  treatment  trials  because  of 
an  inability  to  properly  define  the  patient  population  and  assemble  a  homogeneous  patient 
sample.27'53 
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2.2.3  Metrics  and  Interpretation  of  Test  Properties 

Each  component  of  the  clinical  examination  can  be  considered  a  separate  diagnostic 
test.27  Once  the  clinical  examination  is  performed,  the  clinician  interprets  the  findings 
both  individually  and  collectively  in  the  clinical  decision  making  process.  Determining 
the  relevance  of  the  clinical  examination  findings  in  a  meaningful  fashion  requires  three 
mechanisms:  first,  a  means  of  establishing  a  significant  probability  or  association 
between  an  item  or  items  of  the  clinical  examination  and  the  target  disorder;  second,  a 
means  of  determining  how  much  the  result  contributes  to  the  diagnosis  above  and  beyond 
other  clinical  examination  results;  and  third,  a  means  of  determining  if  the  test  results 
indicate  an  increased  or  decreased  chance  of  the  target  disorder  being  present,  beyond 
that  expected  prior  to  testing. 23  24pp120  125  Three  types  of  metrics  used  to  determine  the 
relevance  of  the  clinical  examination  findings  have  been  described  and  will  be  discussed 
below.24pp'69'1 5254,55 


Sensitivity  and  Specificity-  Test  sensitivity  (Sn)  and  specificity  (Sp)  are  conditional 
probabilities  that  can  be  used  to  define  the  informational  contribution  of  a  test.  Test 
Sn  is  defined  as  the  probability  of  obtaining  a  positive  test  result  when  the  target  disorder 
is  present.  Likewise,  test  Sp  is  defined  as  the  probability  of  obtaining  a  negative  test 
result  when  the  target  disorder  is  absent.24pp81‘82  Sensitivity  and  Sp  calculations  are 
illustrated  in  Figure  1 . 
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Diagnostic 
Test  Result 


Target  Disorder 
Present  (+)  Absent  (-) 


Positive  (+) 

a 

i 

b 

a/a+b=PPV 

Negative  (-) 

c 

d 

d/c+d=NPV 

a 

d 

a+b+c+d=  N  (total) 

Sn=  a  +  c 

Sp=  b  +  d 

Figure  1  Sensitivity  and  Specificity 


desirable  for  a  test  to  have  both  high  So  and  Sp,  factors  .ha,  contnbute  to  unprov, 
respective  proportion  often  mitigate  the  other “  A  single  test  that  results  ,n  a 
dichotomy  (present/absent,  positive/negative)  will  have  only  one  Sn  and  Sp  value.  Tes  s 
that  produce  ordinal  or  continues  results  have  many  possible  Sn  and  Sp  v  ues,  ^ 
depending  upon  the  threshold  criteria  chosen  to  define  a  positive  or  negat.ve  test. 

ZJ,  -  ^  -  -  -  a—  .  —  ' — > 

combining  the  results  of  two  or  more  dicho.om.es  and  treating  this  cluster  as  a  single 
diagnostic  test.  To  increase  specific, y,  for  example,  two  out  of  three  tests  may  e 
required  to  be  positive  in  order  for  the  single  test  cluster  to  be  considered  a  positive 

diagnostic  test.  The  same  procedure  could  be  used  in  a  similar  but  opposite  as  on  o^ 

increase  test  sensitivity  by  minimizing  the  requirements  for  a  positive  test  c  uster. 
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The  threshold  criteria  is  established  depends  on  which  test  property  is  more  ^able^Sn 
or  Sp)  Whether  Sn  or  Sp  is  desired  depends  on  both  the  rnten  e  purp 

or  ,*  24pp-99  for  example,  a  test  used 

screen  or  diagnose)  and  the  consequences  of  intervention. 

screen  for  cancer  should  be  highly  sensitive  in  order  to  prevent  a  case  torn  g 
missed  Specificity  may  be  sacnficed  in  order  to  increase  test  sensitivity  because 
consequences  of  a  missed  case  are  disastrous  compared  to  the  cost  and  d.scomfort 
subsequent  work-up  for  patients  who  have  a  false  positive  test  finding.  Likewise, 
used  for  the  diagnosis  of  a  target  disorder  should  be  highly  specific  ,f  surgrcal 
intervention  is  based  in  whole  or  in  part  on  the  result  of  the  test.  In  this  case  some 
sensitivity  will  likely  be  sacrificed  in  order  to  increase  test  specificity  because  e 
consequences  of  a  false  negative  finding  may  be  only  minima,  when  compared 
increased  morbidity  associated  with  a  false  positive  test.  pp 

Predictive  vame,  unfortunately,  Sn  and  Sp  can  be  evaluated  only  if, he  tnre  health 
status  of  the  patient  is  known.  In  practice,  the  clinician  rarely  knows  a-pnon  ,f  the  ta  g 
condition  is  present  in  the  patient  he  or  she  is  evaluating,  otherwise  the  diagnostic  test 
would  be  uimecessary.  Therefore,  Sn  and  Sp  are  of  limited  value 
probability  of  whether  a  patient  is  more  likely  or  less  likely  to  have  t  e  targ 
P  n  I.  of  the  test J!pp  ss  The  real  question  of  interest  that  must  be  answered  is 

based  on  the  result  of  the  test.  inereaq  disease’”" 

...f  a  patient  has  a  positive  or  negative  test,  how  likely  is  he  or  she  to  have  th  disea^ 

One  method  of  determining  this  probability  is  the  calculation  of  pre  ictive  va  ues 
positive  predictive  value  (PPV)  measures  the  pre-test  probability  that  a  patient  ac  ua^^ 
has  the  target  disorder  when  the  test  is  positive.  Likewise,  the  negative  pre  ic  ive 
(NPV)  measures  the  pre-.es,  probability  that  the  patient  does  no.  have  the  target  isor  er 

H  » i, .*» I-— >  -  f  r:  l 

i--  >••*>»"  i-"“  •" ,i* " 

45 

groups. 

Calculation  of  PPV  and  NPV  is  illustrated  in  Figure  1.  Like  Sn  and  Sp, ^predict™ i  vates 
are  of  limited  clinical  use  but  for  a  different  reason;  they  are  calculated  from  left  ng 
in  the  2  X  2  contingency  table  and  are  therefore  dependent  upon  disease  prevalence 
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which  makes  them  unstable.  Regardless  of  the  test’s  Sn  or  Sp  pr°pert*es> 
falls  the  PPV  must  fall  along  with  it  and  the  NPV  must  nse  Likew,se,  when  p 

rises  so  does  the  PPVwWle  the  NPVfalls^ 88  The  dependence  upon  disease 

prevalence  and  the  unstable  nature  of  positive  and  negative  predictive  values  is  i  u 
LTgures  2  and  3  on  the  following  pages.  In  the  past,  predidve  values  for  an  estimated 
prevalence  rate  of  50%  were  often  given  as  a  standard  characteristic  for  a  test.  Because 
predictive  values  are  prevalence  dependent,  they  are  useless  in  other  settings  w  ere 
prevalence  or  pretest  probability  of  the  disorder  is  different  «  Therefore,  chmcans  must 
match  a  patient's  history  specific  prevalence  to  the  Sn/Sp  values  of  a  given  test  in  or 
I  II  derive  clinically  meaning,,  predictive  values  *  which  can  be  guite  cumbersome 

if  not  impractical  in  a  clinical  setting. 
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Prevalence.  50% 
Present  (+)  Absent  (-) 


(Sn=  .90)  (Sp-  -80) 


no 


(PPV=.82  (a/a+  b)i 


/p^pv=  .89  (a/a  +  b)  ) 
90 


200  =  N  (total) 


Figure  2  Predicitive  Values,  50%  Prevalence 

Prevalence:  5% 

Present  (+)  Absent  (-) 


Positive  (+) 

Diagnostic 

9 

38 

Test  Result 

a 

b 

Negative  (-) 

r 

d 

i 

152 

100  100 


47 


(PPV=.19  (a/a  +  b» 


153 


(NPV=  .99  (a/a  +  b)  ) 


200  =  N  (total) 


(Sn=  .90)  (Sp-  .80) 


Figure  3  Predicitive  Values,  5%  Prevalence 
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■  ikelihoad  Ratios-  Use  of  a  likelihood  ratio  (LR)  is  another  method  for  determining  the 
probability  of  whether  a  patient  is  more  likely  or  less  likely  to  have  the  target  condition 
based  on  the  result  of  the  diagnostic  test.  The  concept  of  a  LR  has  been  advocated  as  a 
better  means  for  assessing  the  properties  of  a  diagnostic  test  and  as  a  practical,  va  ua  e 
tool  for  clinical  decision  making  M  An  LR  is  a  ratio  of  two  probabilities  that  expresses 
the  odds  that  a  given  level  of  a  diagnostic  test  result  (positive  or  “0U“  6 

expected  in  a  diseased  patient  compared  with  anon-diseased  patient  an  is 

illustrated  below. 


Probability  of  test  outcome  given  diseased  patients 
LR=  - “ 

Probability  of  test  outcome  given  non-diseased  patients 


When  an  LR  exceeds  1,  the  odds  favoring  a  disease  increases;  when  the  LR  becomes  ess 
than  1 ,  the  odds  favoring  the  disease  decrease;  and  when  an  LR  approaches  1 ,  the  o 
favoring  a  disease  do  not  change  and  the  test  is  indeterminate.  Positive  (LR+)  and 
negative  (LR-)  LR’s  algebraically  combine  Sn  and  Sp  to  describe  more  than  the 
independent  values  themselves;”  they  summarize  the  information  of  both  Sn  and  Sp  an 
thereby  represent  the  discriminative  power  of  a  test.  Positive  and  negative  LR  s  are 
computed  in  the  following  manner:54 


LR+=  Sn/(1-Sp) 


LR—(1-Sn)/Sp 


The  following  example  based  on  a  study  by  Fntz  et.  al.  is  helpful  for  illustrating  th 
interpretation  ofLR’s: 

A  treadmill  walking  test  (longer  walking  time  during  inclined  walking)  is  used  to 
diagnose  patients  suspected  of  having  lumbar  spinal  stenosis.  The  treadmill  test  LR+ 
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6.49  and  the  LR-=  .54.  Tins  means  that  a  positive  treadnnll  test  is  6.49  tunes  more  likely 
to  occur  in  patients  with  lumbar  spinal  stenosis  than  from  those  without  lumbar  spina 
stenosis.  Similarly,  a  negative  treadmill  test  is  only  .54  times  as  likely  to  occur  in 
patients  with  lumbar  spinal  stenosis  than  from  those  without  lumbar  spi 


Several  authors  have  described  three  important  properties  or  advantages  of 

jj^,^.24.54.55pp  120-123 

1  Likelihood  ratios  are  stable.  Because  they  are  calculated  vertically  in  the  -  X 
contingency  table,  LR’s  do  not  change  with  changes  in  the  prevalence  or  pretest 

probability  of  the  target  disorder. 

2  Likelihood  ratios  may  be  established  for  multiple  levels  of  test  outcome. 
Establishing  multiple  level  LR’s  improves  their  diagnostic  properties  for  test 
results  that  are  ordinal  or  continuous  scaled. 

3  Likelihood  ratios  allow  a  clinician  to  immediately  assess  the  impact  of  a  test 
result  on  the  post.es,  probability  that  a  patient  will  have  the  dtsease  of  .merest  and 
can  guide  sequential  testing.  If  the  pretest  prevalence  (or  probabthty)  of  a  d.sease 
is  known  or  can  be  estimated,  the  posttes,  probability  of  the  disease  bemg  present 
can  be  calculated  using  the  formula  below  which  is  be  derived  from  Bayes 

theorem. 


Pretest  odds  *  Likelihood  Ratio-  Posttest 


Odds  for  the  Target  Disorder 


Where:  Prevalence/(1  -  Prevalence)-  Pretest  Odds 

Because  clinicians  may  be  more  comfortable  with  probabilities  than  odds,  the  posttes, 
odds  may  be  converted  back  to  a  probability  in  the  following  manner  . 


Posttest  Odds/(l  +  Posttest  Odds)=  Posttes.  Probability 
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Once  again,  the  spinal  stenosis  example  from  Fntz  et.  al.  is  helpful  for  demonstrating 
how  the  LR  of  a  test  (LR+  in  this  case)  may  be  used  to  change  the  probability  estimate 

■  24pp.  123- 

for  the  presence  or  absence  of  a  disorder  in  a  given  patien  . 


126 


Three  disadvantages  of  LR’s  have  also  been  reported  and  include  the  following, 

, .  Knowledge  of  a  test's  Sn  and  Sp  is  still  required.  Because  the  same  LR  can  be 
the  result  of  the  combination  of  very  different  Sn  and  Sp  values,  the  Sn/Sp  of  a 
test  must  be  known  when  false  positives  or  false  negatives  are  to  be  avoided  as 

much  as  possible. 
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2.  The  posttest  probabilit.es  generated  by  LR  values  are  no.  linear,  the 
discriminative  strength  ti  e  resultant  posttest  probabilities)  of  an  LR+  value  o 

is  not  ten  times  that  ofanLR+ value  of  10,  j 

3.  The  precision  of  LR’s  depends  on  the  proportion  of  diseased  and  non-disease 
subjects.  The  confidence  interval  around  an  LR  becomes  progressively  wider  as 
the  imbalance  between  diseased  and  non-diseased  subjects  increases. 


Another  potential  disadvantage  is  the  burden  for  clinician  to  establish  posttest 
probabilities.  The  need  to  convert  back  and  forth  between  pretest  probabihty/pretes,  odds 
and  post.es.  odds/posttest  probability  can  be  confiising  and  somewhat  time  consuming. 
However  this  problem  is  easily  remediedwith  the  use  of  Fagan’s  nomogram  (Figure 
4)  »  Once  the  prevalence  of  a  disorder  has  been  estimated  and  the  LR’s  of  a  given  test 
L  known,  the  posttes.  probability  can  be  deterged  by  using  a  ruler  and  the  nomogram. 


99 
PRETEST 
PROBABILITY 


likelihooo 

RATIO 


posttest 

probability 


Figure  4  Fagan’s  Nomogram 
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Since  LR’s  refer  to  actual  test  results  before  disease  status  is  known,  they  are 
immediately  more  useful  to  clinicians  than  Sn  or  Sp.55  Although  the  predictive  use  of 
LR’s  has  limitations,  LR’s  represent  a  distinct  advantage  over  the  traditional  use  of  PPV 
and  NPV.  Likelihood  ratios  (LR+/LR-)  and  their  respective  95%  confidence  intervals 
will  be  calculated  for  each  diagnostic  test  and  diagnostic  test  cluster  assessed  in  this 

study. 

2.3  Reference  Criteria  or  "Gold  Standards" 

The  traditional  medical  model  of  disease  proposes  that  all  disease  may  be  defined  y 
deviations  from  pathophysiologic  norms  and  that  the  underlying  cause  of  disease  must  be 
identified  before  appropriate  corrective  measures,  in  the  form  of  treatment,  can 
implemented.”  Indeed,  Taylor  has  stated  that  the  current  understanding  of  this  model 
has  come  to  be  strictly  associated  with  the  identification  of  structural  abnormality  rather 
than  referring  to  clinical  or  etiological  events  “  Although  the  simplicity  of  this  model  is 
attractive,  it  is  well  known  that  symptoms  and  pathology  are  not  always  strongly 
correlated  in  a  number  of  conditions. 


Cervical  radiculopathy  is  a  condition  in  which  the  nerve  root  is  insulted  and  typically 

results  in  pain,  disturbance  of  function,  and  may  often  be  accompanied  by  a  variety  of 

,  -  .  ,  .  ,  64pp. 537-539  Therefore  CR  is  subsumed  very  well 

anatomic  and  pathophysiologic  changes.0^  1  heretore,  ^ 

by  the  traditional  medical  model.  It  does  not  appear,  however,  that  carpal  tunnel 

syndrome  is  as  well  accounted  for  by  this  model  as  is  CR.  The  term  “syndrome  is 

defined  as  “a  concurrence  of  symptoms”  or  “the  aggregate  of  signs  and  symptoms  ^  ^ 

associated  with  any  morbid  process  and  constituting  together  the  picture  of  the  disease”. 

Accordingly,  a  cluster  of  signs  and  symptoms  may  not  necessarily  be  attributable  to  a 

distinct  anatomical  abnormality.  Despite  the  connotations  of  the  term  “syndrome”,  the 

signs  and  symptoms  of  CTS  are  attributable  to  compression  of  the  median  netve  in  the 

carpal  canal.66  Therefore,  CTS  may  also  be  identified  by  a  pathophsyiologic  abnormality 

of  the  median  nerve  in  a  majority  of  patients6’  Since  both  of  the  conditions  of  interest  in 

this  study  may  be  defined  on  the  basis  of  pathophysiological  abnormalities,  the  ideal 

reference  criterion  (referred  to  hereafter  as  “gold  standard”)  used  to  assess  the  efficacy  of 
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diagnostic  test  procedures  is  one  capable  of  detecting  pathophysiologic  a 
wl  may  he  singularly  applied  to  a  patient  suspected  «  e 

EMG/NCS  examination  that  includes  needle  electromyography^(EMG)  ^  peered  upper 
conduction  studies  (NCS)  of  selected  muscles  and  new  patients 

quarter  meets  this  criteria  and  ,s  commonly  used  md.  of 

suspected  of  having  either  CR  or  CIS.  A  „0  participa.e  i„  this 

,he  symptomatic  upper  quarter  will  be  administered  to  a  P^  ^ 

study. .  Abnormalities  of  the  EMG/NCS  examination  will  serv  as  g 

the  diagnosis  of  both  conditions.  The  diagnostic  properties  of  the  EM 

examination  will  now  be  discussed  separately  for  each  condition  and  a  discussion 
acceptable  gold  standards  for  patient  outcome  will  follow. 

2US*D****«**  f  the  EMG/NCS  exam  in  patients  with  CR  is  to 

The  purpose  of  the  EMG  component  of  the  EMG/N  h  t0  a  cervical 

detect  neural  of  the  EMG/NCS  exam 

distal  mononeuropathy  64pp541  muscles  as  well  as  the 

The  tvoical  EMG  examination  consists  of  assessing  severa 

The  typical  EMG  muscles  sampled  each  represent  the 

— — -»■— r 
rr ■  - -  -  *  -  r; 

individual  muscles  consists  of  two  main  steps.  First,  elec  ^  ^ 

a,  several  depths  and  in  various  directions  m  a  given  ™“e  m 

i—i .„ .  - — ■  -  ■ - — —  -r—rr 

primarily  in  the  form  of  fibrillations.  Fibnllations  and  ot  ftef  ,4.21  days 

forms  of  abnormal  spontaneous  activity  occur  in  deinnervate 
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and  are  due  to  the  development  of  acetylcholine  hypersensitivity  by  the  muscle 
membrane,  winch  renders  it  unstable.68'69  The  second  step  is  to  have  the  patient 
voluntarily  contract  the  muscle  being  examined  which  will  elicit  motor  unit  action 
potentials  (MUAP).  The  morphology  of  the  MUAP  (amplitude,  duration,  and  the  number 
of  phases)  and  recruitment  pattern  are  then  assessed  for  abnormalities.  Typical 

or  standard  NCS  procedures  used  in  the  examination  of  patients  with  suspected  CR  are 
described  below  in  the  CTS  section.  The  findings  of  the  EMG  and  NCS  examination  are 
then  integrated  and  if  abnormalities  are  present  and  consistent  with  a  lesion  of  the 
cervical  root,  then  the  diagnosis  of  CR  is  established. 

Lacking  a  better  method  for  detecting  nerve  root  pathophysiology,  investigators  have 
attempted  to  establish  the  sensitivity  and  specificity  of  EMG  by  comparing  it  to  other 
pathoanatomic  procedures  used  in  the  evaluation  and  diagnosis  of  patients  with  cervical 
radiculopathy.  These  procedures  include  imaging  studies  (myelogram  and 
CT/myelogram72)  and  surgical  observation70’73-75  Because  of  the  difference  in  purpose  of 
these  procedures,  the  use  of  a  pathoanatomic  gold  standard  to  determine  the  diagnostic 
accuracy  of  a  procedure  such  as  EMG,  which  defines  abnormalities  based  on 
pathophysiology,  is  invalid  .6'64pp554  Instead,  any  comparison  of  the  two  procedures 
should  merely  be  interpreted  as  a  correlation  or  percent  agreement  and  a  degree  of 
divergence  would  be  expected.  Even  the  use  of  surgical  observation  as  a  reference 
criterion  for  studies  of  EMG  diagnostic  accuracy  is  problematic  and  precludes  the 
establishment  of  specificity  because  it  could  not  feasibly  or  ethically  be  applied  to  the 
entire  patient  sample.40  Determining  the  diagnostic  accuracy  of  any  test  for  spinal 
disorders  is  problematic  due  to  the  difficulty  of  establishing  a  suitable  reference  criterion 
or  gold  standard.40  The  diagnostic  accuracy  of  needle  EMG  for  cervical  radiculopathy  is 
no  exception  and  depends  upon  the  clinical  parameter  or  reference  criterion  which  is 

«  40.76 

chosen. 

Needle  electromyography  is  the  oldest  electrophysiologic  examination  procedure  for  the 
diagnosts  of  radiculopathy.73  The  percent  agreement  between  positive  EMG  findings  and 
surgically  observed  abnormalities  in  patients  with  CR  ranges  from  54%  (95%  confidence 
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interval  (950)-  44%-64%)75  to  100%  (95%CI=  85%-100%)  ,  s.mtlar  observa  tons 
been  repotted  for  patients  with  luntbo-sacra,  radiculopathres  (78%-90%)”'7  '  However, 
the  interpretation  of  EMG  findings  and  surgically  observed  abnormal,., es  ,n  pat.ents  w„ 
radiculopathy  is  somewhat  confounding  because  some  authors  do  not  specify  the 
used  to  determine  the  presence  of  an  abnormality:  a  herniated  nucleus  pulposus 
an  irritated  neive  root.”'75  ”  A  diagnosis  of  CR  based  on  the  mere  observation  o  an 
HNP  is  inappropriate  45  Boden  et.  al.  found  herniations  of  the  cervical  intervene  ra  ,sc 
to  be  present  in  18%  of  63  asymptomatic  volunteers7’  and  an  even  larger  percentage  o^ 
false  positive  findings  in  the  lumbar  spine  have  been  reported  by  numerous  aut  ors. 
Wilboum's  statement  that  needle  EMG  is  nearly  100%  specific  for  the  examination  o 
patterns  with  suspected  radiculopathy*4  cannot  be  substantiated  due  to  the  methodology 
limitations  mentioned  previously,  but  no  one  has  reported  a  false  positive  EM  ^ 
patients  treated  surgically  for  CR;70  ”'7!  the  same  cannot  be  said  for  imaging  stu  ,es. 

Myelography  is  the  imaging  procedure  EMG  has  been  most  frequently  compared  with. 

The  correlation  between  myelography  and  EMG  in  patients  with  radmulopathy  ,s 

consistently  high  for  both  the  cervical  (75%  <95SO-  61%-95%)  and  lumbosacral 

(90%  (95%C1=  77%-100%))*6  regions  and  a  complementary  relationship  between  e 

L  procedures  for  the  diagnosis  of  radiculopathy  has  been  acknowledged  in  aU  identified 
**  i  _ l. .  thp  qhilitv  to  detect 
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reports.77'*6'’1  The  advantages  of  EMG  versus  myelography  include,  the  ability  to 
lateral  root  entrapment;76  detect  insidious  disease  processes;75  and  no.  injecting  foreign 
material  into  the  body64”'554  Shared  disadvantages  are  that  both  procedures  are  invasive 
a„d  involve  various  degrees  of  discomfort.  A  high  percent  agreement  has  also  been 
reported  when  EMG  is  compared  to  CT  in  patients  with  lumbosacral  radiculopa.  es 
(85%77  to  89%  (95%C1=  80-98%)’*).  In  the  only  study  in  which  dam  were  statistic,  y 
analyzed,  EMG  was  found  to  be  superior  over  CT  (P<  0001)  and  the  clinical  exam  for 
detecting  which  lumbosacral  nerve  root  was  involved.77  The  only  study  to  compare  EMG 
with  CT  in  patients  with  suspected  CR  found  an  agreement  of  67%  for  the  two 
procedures  (95%CI=  4l%-93%)”  Although  the  use  of  non-invas,ve  imaging  techniques 
such  as  CT  and  MRI  for  diagnosis  of  radiculopathy  is  appealing,  both  procedures  support 
pathoanatomic  diagnoses  in  10%  to  30%  of  the  asymptomatic  population  depending  on 
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age  79-83,94,95  This  j$  of  concem  because  surgical  intervention  for  patients  with  CR  may 
often  based  on  positive  test  results.  Given  the  potential  complications  associated  with 
surgery,  the  low  morbidity  associated  with  untreated  CR,  and  the  fact  that  prognosis  for 
recovery  is  good  in  the  majority  of  cases,96'100  it  can  be  argued  that  a  diagnostic 
procedure  which  yields  few  false  positive  findings  (i.e.  is  highly  specific)  is  warranted  for 
the  diagnosis  of  patients  with  suspected  CR. 101 

Other  electrophysiologic  examination  procedures  for  the  diagnosis  of  radiculopathy  have 
been  advocated  in  an  attempt  to  increase  the  sensitivity  of  EMG  and  include  an  analysis 
of  motor  unit  action  potentials  (MUAP)  and  the  evaluation  of  evoked  potential  latencies 
(flexor  carpi  radialis  (FCR)  H-reflex,  median  and  ulnar  F-waves,  and  dermatomal 
somatosensory  evoked  potentials  (DSEP))76  Although  these  additional  procedures  may 
increase  the  yield  of  abnormalities  detected  during  the  electrophysiologic 
examination,75  93402  muscle  membrane  instability  observed  during  needle 
electromyography  is  still  considered  the  hallmark  diagnostic  sign  and  the  single  most 
sensitive  pathophysiologic  method  for  establishing  the  diagnosis  of  both  lumbar  and 

.  ,  73,75.77.78.103 

cervical  radiculopathy. 

There  is  some  evidence  that  EMG  may  be  useful  in  predicting  the  outcome  of  patients 
with  radiculopathy.  Two  studies  have  reported  that  patients  with  normal  pre-operative 
EMG  findings  have  poorer  surgical  outcomes  as  expressed  by  symptom  relief  or 
measured  pain  intensity  compared  to  patients  in  whom  pre-operative  EMG  abnormalities 
were  observed  (p<01).104  One  study  reported  that  patients  with  an  abnormal  pre¬ 
operative  FCR  H-reflex  had  a  better  clinical  outcome  at  two  years  (p<  03)  than  did 
patients  who  had  a  normal  pre-operative  H-flex;  a  similar  relationship  was  not  observed 
for  needle  EMG  findings.70  Despite  these  reports,  the  relationship  between  EMG  and 
patient  outcome  is  still  inconclusive  because  some  studies  used  non- standardized 
outcome  instruments  with  unknown  psychometric  properties,70  75  the  number  of  subjects 
or  cell  sizes  were  limited,104  and  data  were  not  analyzed  statistically.75 
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2  3  2  Carnal  Tunnel  Syndrome 

Because  NCS  and  needle  EMG  provide  a  unique  way  of  directly  assessing  the  integrity  of 
sensory  and  motor  nerve  fibers,  they  have  become  the  mainstay  for  the  laboratory 
evaluation  of  CTS.64p1396  The  primary  purpose  of  the  EMG/NCS  exam  in  patients  with 
suspected  CTS  is  to  detect  and  localize  abnormalities  to  the  distribution  of  the  median 
nerve.  Additional  purposes  may  be  to  rule-out  other  causes  of  symptoms  such  as  a 
diffuse  peripheral  neuropathy,  a  more  proximal  mononeuropathy,  and  in  some  cases  rule 
out  a  concomitant  CR.64pp869'875'6  Usually  one  or  more  median  innervated  muscles  is 
examined  with  needle  EMG  as  well  as  a  radial  or  ulnar  innervated  muscle  for 
comparison,  except  in  cases  of  suspected  concomitant  CR  when  more  comprehensive 
muscle  sampling  is  performed. 64pp873  The  procedure  for  the  EMG  examination  was 
described  in  section  2.3.1.  In  a  typical  NCS  examination  of  a  patient  with  suspected 
CTS,  both  the  sensory  and  motor  components  of  the  median  nerve  are  assessed.  Surface 
electrodes  are  placed  on  the  wrist  or  fingers  to  record  evoked  potentials  when  nerve 
stimulation  at  the  wrist  or  elbow  occurs.  Alternatively,  recording  electrodes  may  be 
placed  over  the  nerve  at  the  wrist  to  record  evoked  potentials  when  the  digit  or  palm  is 
stimulated.  For  comparison,  the  ulnar  nerve  is  examined  in  a  similar  fashion  although  the 
radial  nerve  may  also  be  used.105  The  latency  and  amplitude  of  the  evoked  potentials  are 
the  most  commonly  assessed  NCS  parameters.  The  findings  of  the  EMG  and  NCS 
examination  are  integrated  and  abnormalities  of  latency,  amplitude  and  muscle 
membrane  stability,  when  present  and  isolated  to  the  median  nerve  distal  to  the  wrist, 

help  establish  the  diagnosis  of  CTS.66 

In  1956,  Simpson  was  the  first  to  report  the  usefulness  of  median  motor  nerve  conduction 
studies  in  the  diagnosis  of  CTS.106  His  observations  were  later  validated  by  a  number  of 
other  investigators107108  and  assessment  of  the  sensory  component  of  the  median  nerve 
was  also  included  as  advances  in  technology  made  this  feasible.109  Using  intraoperative 
NCS,  Brown  confirmed  that  nerve  conduction  abnormalities  of  the  median  nerve  in 
patients  with  CTS  were  localized  to  the  area  under  the  transverse  carpal  ligament. 
Fullerton  suggested  that  two  mechanisms  were  responsible  for  the  signs  and  symptoms  of 
CTS:  one  is  a  rapidly  reversible  change  in  the  nerve  fiber  associated  with  episodes  of 
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ischemia  and  the  other  is  due  to  slowly  developing  structural  changes  in  the  nerve  fibers 
due  to  compression  of  the  median  nerve  under  the  transverse  carpal  ligament. 


Clinical  investigators  have  developed  and  refined  a  variety  of  techniques  in  order  to 
maximize  the  sensitivity  and  specificity  of  NCS/EMG  procedures  for  the  diagnosis  of 
CTS.  These  techniques  include  but  are  not  limited  to:67,112  comparisons  of  latencies 
(bilateral  median  nerves  and  median  nerve  with  ipsilateral  ulnar  and  radial  nerves),  short 
segment  mixed  nerve  latencies;  sequential  short  segment  (1cm)  latencies;  and  comparison 
of  nerve  conduction  velocity  (NCV)  across  the  carpal  tunnel  with  NCV  of  a  finger  or 
forearm  segment.47,113’114  The  reported  specificity  of  sensory  NCS  is  excellent.  An 
assessment  of  long,  short,  and  comparative  sensory  techniques  as  well  as  distal  motor 
latency  NCS  in  several  large  series  of  patients  (n=  100-3  00)  suspected  to  have  CTS  is 
>  95  (  95%CI=  .95-1 .0). 1 15"1 17  The  sensitivity  ofNCS  is  lower  and  varies  depending  on 
the  technique  used:  standard  sensory  conduction  techniques  range  from  .49  to 
,84117while  short  segment,  mixed  nerve  conduction  techniques;  and  techniques  that 
compare  the  ipsilateral  median  and  ulnar  nerve  range  from  ,69115  to  .84. 

Two  recent  reports47,113  show  that  the  ratio  of  the  NCV’s  across  the  carpal  tunnel  and 
NCV  of  either  the  forearm  or  digit  is  both  sensitive  and  specific  for  the  diagnosis  of  CTS. 
These  later  two  works  support  the  earlier  findings  by  Kimura  and  Ayyar  who  tested  814 
limbs  and  found  slowing  of  the  sensory  NCV  across  the  carpal  tunnel  relative  to  the 
forearm  in  100%  of  CTS  patients  but  not  in  any  of  the  asymptomatic  control  subjects. 

In  a  sample  of  50  hands  with  clinically  confirmed  mild  to  moderate  CTS  and  40  normal 
controls,  Padua  et.  al.  computed  the  ratio  of  the  NCV  in  the  3rd  digit  and  the  nerve 
conduction  velocity  across  the  carpal  tunnel.  This  ratio  was  called  the  distoproximal  ratio 
and  reported  a  sensitivity  of  .98  (95%CI=  94-1.0)  and  a  specificity  of  >  .95  (95%CI=  95- 
100%)  for  the  procedure.113  Gunnarsson  et.  al.  evaluated  the  diagnostic  accuracy  of  a 
similar  NCS  technique  in  which  the  NCV  across  the  carpal  tunnel  and  the  proximal  NCV 
in  the  forearm  is  used  to  compute  a  NCV  ratio.  This  ratio  was  obtained  in  169  hands 
referred  for  neurophysiologic  evaluation  of  CTS.  The  diagnosis  of  CTS  was  then 
retrospectively  established  three  months  later  by  using  a  combination  gold  standard  of  a 


-> 

were  required  to  establish  the  diagnosis  of  CTS.  An  were  m 

j  determine  the  optimal  sensitivity  and  spec.fic.ty  values  w 
curve  was  used  to  determi  p  4,  lhod  reported  by 

(95%CI=  68-88)  and.87  (95%CI=  .76-.94),  respectively.  Theme  P 

Padua  et.  al.  will  be  used  in  tins  study  due  to  its  utility  and  ease  of  pe  o 

^houghNCS/EMGproceduresaretheomy  method 

5tatus  of  neural  elements  (axon  and  myelin)  in  peripheral  n^^  m 

obtained  relief  after  CTS  surgery  despite  havmg  a  norma  thirty.,wo 

Grundberg  performed  cmpal  tunnel  release  surgery '» ^  ^ 

of  whom  were  operated  on  despite  normal  nerve  conduct  ^  ^  median 

following:  thirty  one  of  the  thirty-two  pat.ents  experience - 
nerve  abnormalities  were  found  in  22  of  these  patients;  ffCS 

in  8  and  moderate  compression  was  noted  in  one  subject.  »  **  ’ 

abn— shavebeenobservedinas^tomaticsubjec,^^, 

,  H9  Most  stucjies  reporting  on  the  talse  negauv 

techniques  are  performed.  provided  a 

■  frTs  have  either  been  retrospective,  have  nor  p 
EMG/NCS  in  the  dtagnos.s  of  CTS  have  e  or  treatme„t 

valid  or  unbiased  reference  entenon  >.e.  go  n  clinical  opinion  and 

outcome),  or  used  less  sensitive,  long-segmen  determining  the  diagnostic 

the  clinical  exam.nat.on  have  also  served  as  gold  standavd  fcrdem™  ^  ^  ^ 

f  xjpc  unt  the  validity  and  reliability  of  these 
propert.es  of  NCS  but  the  validity  canal  has  als0  been 

established.  Magnetic-resonance  imaging  (MRI)  redictive  value  has  yet  to  be 

considered  for  the  diagnosis  of  CTS  but  its  diagnostic  and  pred.ct.ve 
determined.120 

Vh  CR.  there  is  some  evidence  that  NCS  procedures  may  be  useful 

foTpredicting  outcome  in  patients  with  CTS  who  have  b«n  treated 
patients  with  surgically  treated  CTS:  it  is  limited;  the  majority  o 
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retrospective;111'15  few  use  standardized  outcome  The 

and  approve  statist, ea,  analyses  —  ^ 

one  repo*  that  underwent  a  second  operatton  for  CIS 

that  assessed  the  outcome  of  131  patterns  patients  undenvent  a 

to  persistent  or  recurrent  tabling  symptoms,  re  ^  ^  an(J  Wome„’s  Hospital 

standardized  EMG/NCS  e— on  and  comp  e  ^  ^  NCS  results  were 

Hand  Symptom  Seventy  Scale  )  patjents  wjth 

^  ”,r::,;sss 

Arative  NCS  abnonnalities  had  significantly 
preoperative  in^  egg  scores  were 

significantly  improved  at  a  p  value  o  oreoperative  NCS  and  improved 

quality  have  reported  a  relatiomhip  betwee^J  ^  patients  self-report 

post-surgical  outcomes  as  measured  by  «  ^  ^  w  patjent 

instruments,113 111  impairment  measures. 


121.122.124 


.  .  ,24.125.127,129 

but  conflicting  reports  also  exist. 


opinion 

,  ..  rR  ancj  CTS  and  is  a  chief  concern 

In  summary,  neural  impairment  charactenzes  13W  NCS/EMG 

when  managing  patients  with  etther  °f  <“  °  of  CR  or  CTS,  they  are 

procedures  are  not  100%  sensmve  or  spec.fi  00„dition,  While 

st n:«- 

— *1 » „<»CS  ™ 

objective  diagnostic  test  continues  to  be  an  in  bo.h 

a  c  are  the  only  way  of  assessing  physiologic  neural  impa 
procedures  are  th  y  ,  CTS  will  be  based  on  the 

,  •  64pp  554  In  this  study,  the  diagnosis  of  CR  a 
conditions.  In y’  examinatio„  and  opimon  of  the  EMG/NCS 

presence  of  abnormalities  of  the  EM  rablished  the  provider  will  classify 

the  patient  according  to  seventy  o  used  in  this  study  are  modifications 

pp  75  .77  The  CR  and  CTS  classification  schemes  used 
Of  similar,  previously  published  classification  systems 
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2iiJ*!!Lei!1S2!n£  .  •  hl„  of  imerest  will  serve  as  a  gold  standard  for 

When  deciding  what  variable  or  vana  type  of  instruments 

palien,  outcome,  the  level  of  drsablement,  level  of  outcome,  and  the  P 

oMneasures  to  be  used  are  all  issues  that  must  be  carefully  considered. 

that  ,s  ..near  in  nature However,  numerous  stud  les  r^  <  ^  rf 

of  med, cal  care  provrde  clear  evidence  that  the  The 

disablement  depicted  in  Nagi’s  model  are  not  a  ways  d,  ec  ^  ^  „ 

presence  of  disease  and  impainnent  “  ^ pfLuonal  Umitations  or  disability 
disability  and  proporttomd  change ^  same  relatlonship  between  the 

are  not  always  observed.  fi,nctional  limitation  and/or 

levels  of  disablement  is  also  apparent  m  patten  s  w  rtional  to  the 

disability  ,  disease  and  impatrment  may  be  ^sent  or  ^  ^  ^ 

severity  of  functional  limitation  and  dtsab  y  ^  variabies  should  be 

levels  of  disablement  has  implications  for  dec,  mg  w  ^  care  ^ 

assessed  when  monitoring  patient  response  to  treatment 

settings. 

he  directed  at  different  levels  of  disablement  and  may  be 

Treatment  interventron  may  A  conceptual  framework  depicting  the 

assessed  at  a  variety  of  outcome  ev  .  ha$  been  proposed  by 

relattonship  between  level  of  interventron  and  level  of  ^ 

Whyte  which  was  adapted  to  fit  Nagi’s  dtsablement  model.  (F  gu 
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Figure  5  Whyte’s  Outcome 
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Model  (adapted  from  Whyte 


outcome,  it  is  thought  that  sensitivity  to  treatm  taterest.,J,  l*°  When  levels  of 

treatment  effects  have  a  direct  impact  on  the  outc  outcome  are  above  or  below 

intervention  and  the  corresponding  levels  of  measure^  ^  ^ 
the  main  diagonal  cells,  there  ,s  often  a  tra  e-  ^  of  intervening 

(ability  to  detect  change)  and  clinical  re  evance  may  not 

variables.139  Because  the  level  of  treatment  and  level  ^  outcome  gold 

always  coincide.  Whyte's  framework  may  e  p  the  levels  of 

-r=: 

the  outcome  measurement. 


30 


to  diagnose  the  target  disor  -  reflect  pathology  and  impairment  levels 

Of  disablement  •«>  ^  with  radiculopathy,  EMG  abnormal,  ttes 

status  has  occurred.  ,  , n;t.  relatively  minimal 

may  be  observed  years  after  the  initial  onset  of: symptoms  ^ 

symptoms  and  disability ’“4  The  following  non- 

with  CIS:  although  marked  improvement  ,s  o  served  n  P 

surgical  and  surgical  intervention.  NCS  may  show  no  ,mpr  v  ^ ^ 

,  122.123.127,137.138  Tfleref0re,  a  change  in  the  patient 

pathology /impairment  level  measurement)  are  used  as  an  otheo^  ^  rf  ^ 

’thCRandCTS.  Although  treatment  will  no 

this  study  is  primarily  at  the  functional  limitation  and  disability  eve 

— — - — — 

have  been  shown  to  be  valuable  measures  of  pauent  ^  ^  ^  A  „umber 

conceptual,  methodological,  practical,  and  atutu  m  e5tablished 

psychometric  proper,, es  an  often  ^  ^  purp0se5  to  compare 

society  « 141  Unfortunately,  most  HS AM  change  score 

groups  of  patients  and  what  consulates  a  meamn^  ^  indicators  of  outcome 

for  a  given  instrument  is  largdyu^  ^  owm  with  treatment  of  a 

may  be  m  familiar  to  many  interested  parttes  involved  ,n  the 

given  condition.  T  ftenuently  provide  little  or  no  useful 

management  of  health  care  delivery  but ,  ft  for  guiding 

information  about  the  status  of  individual  patients  and  are 

.  .  140 

treatment  decisions. 

:;"“T 

and  function  is  that  no  objective  external  gold  standard 
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addition,  there  is  no  consensus  for  what  the 

change92'145*146  Despite this  station,  se«ra^  ^  ^  change  when  assessing 

patient’s  or  clinician  s  ratings  as  g  92j43.i46  Qi0bal  perceived 

the  responsiveness  of  health  status  assessment ^  functional  status,  and 

effect  is  an  outcome  measure  for  improvement  i«  jaeschke  has 

- — s  or — - 
described  a  1  ^  ^  ^  ranges  ftom  _7  C'a  very  great  deal  worse  )  to 

perception  of  improv  better).  Intermittent  descriptors  of 

“»  (“about  the  same  )  to  (•  *»  *  ^  descliptors  of 

worsening  are  assigned  values  from  -1  to  ^  , 

improvement  are  assigned  values  from  +1  to  +6. 

The  complete  list  of  descriptors  with  the  corresponding  values  is  as  follows, 

/n\  A  verv  sreat  deal  better  (+7) 

A  very  great  deal  worse  (-7)  About  the  same  (0)  A  great  deal  better  (+«> 

A  great  deal  worse  (-6)  Quite  a  bit  better  j 

Quite  a  bit  worse  (-5)  Moderately  better  ^  ' 

Moderately  worse  H)  Somewhat  better  t 

Somewhat  worse  (*3)  A  little  bit  better  t 

A  little  bit  worse  (-2)  A  tiny  bit  better  v 

A  tiny  bit  worse  (almost  the  same) 


A  very  great  deal  worse  (-7)  About  the  same  W  A  great  deal  better  (+6) 

A  great  deal  worse  (-6)  Quite  a  bit  better  j 

Quite  a  bit  worse  (-5)  Moderately  better  t 

Moderately  worse  H)  Somewhat  better  t 

Somewhat  worse  (*2)  A  little  bit  better  v 

A  little  bit  worse  (-2)  A  tiny  bit  better  v 

A  tiny  bit  worse  (-1)  (almost  the  same) 

(almost  the  same) 

The  use  of  retrospective  GSRS  as  a  gold  standatd  ~ 

-  "  -  -  *  ^nutations,  the  use  of  a 

status  may  be  inaccurate  or  .ase  standard  ents  a  credlble 

option  in  the  absence  of  an  external  go  ^ 

feasible,  and  useful  method  for  assessing  outcome  ^ 

TWs  study  will  measure  two  outcome  — ^ ^ ^  at  6  weehs 

surgical  or  non-surgical,  a  patient  r  This  dichotomoUS  grouping  (surgicaUnon- 

from  the  time  of  enrollment  in  t  e  stu  y.  concerns  about  morbidity 

and  timeliness  of  intewention.  If  a  patien 
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condition  with  non-surgical  trea tment,  ■«  may  be  ^  requires 

complications  associated  wit  surgery.  ...  y  be  experienced 

surgery  in  order  to  obtain  symptom  relief,  less  cost  and debilitation  my  ^  ^  ^ 

by  Offering  the  patient  a  surgical  option  ^  ^  ^  rating  of 

patient  perception  of  improvement  using  a  GS  P  reaSons: 

improvement  is  the  optimum  outcome  variable  of  choice  m  dn  s~  * 

1  „  captures  meamngftil  information  representmg  Severn,  constructs  th  ^ 

for  the  patient 146  2.  Measures  of  neural  and  clirnca  impai 
concern  for  the  pane  ,  The  MCID  of  region  and  disease  specific 

relatively  unresponsive  to  change, 

HSAM  for  CR  and  CTS  is  unknown. 

1 4  Cnl«4l  Appr»i»l  «fC1i.ical  Euminaiio"  V-n— '  ■I“1  Vi  rv"-" 

,:c.— ».  -T'  — 

.  m  Hunter  from  the  clinical  examination  or  patient  sen  rep 
An  item  or  an  item  cluster  me  However,  most 

may  be  useful  for  diagnosis  of  condition  and  P" 'nls  of  the  clinical 

studies  that  have  assessed  the  diagnostic  test  prope  diagnostic  test 

examination  and  patient  sel, report  instruments  con  am  —  ofrr,  ,  ^ 

methodological  research  principles.  Therefore  g  ^  ^  A 

self-report  instruments  is  discussed  below. 

are  used  when  evaluating  patients  with 

Several  common  clinical  — . on  measures  ar  ^  ^  ^  ^  Md 

established,  inter-rater  reliability  is  a  pre-requisite  for  establishing  A 

accuracy  and  predictive  validity  of  any  examination  proce  urn  u 

•4  ■  c,™™  24 pp.33-i4  The  inter-rater  reliability,  diagnostic 

impairment  or  aid  in  diag  „„,v  used  clinical  examination  measures 

and  predictive  validity  of  the  following  commonly  used 
will  be  evaluated  in  this  study 
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Qusaims:  1" majonty  of  disease  states, 

diagnosis  can  be  made  from  informal  obtamed  location  and 

time38'10  Although  not  common,  a  quest, on  regar  mg  symp  ,  „  ^ 

behavior,  and  history  can  be  used  and  measured  as  ^  ^ 

diagnostic  test  questions  must  be  deveioped  usmg  the  — hought  t0  be  more 
methodology  and  metres  apphed  to  other  d.agnost.c  p  j7  test 

objective,  typically  laboratory  tests  or  chmcal  eaamma  ton  3»»and 

questions  are  often  the  most  powerftil  diagnostic  measure  ? 

can  possess  sensitivity  and  specificity  values  of  1.0  m  so 

t,  ,  t  tn  ue  0f  diagnostic  value  when 

examimng  patterns  vnt  su  p  asses5  lhe  diagnostic  properties  of 

formally  or  only  lirnitedly  assessed.  This  y  s  fluently  reported  by 

the  questions  and  responses  listed  below  w  c  a  reliability  of  these 

patients  with  CR  and  CTS.  form,  which  u  contacted 

items  are  available.  All  1 1  questions  are  listed 

in  appendix  C.  hnthersome  for  you'7 

1.  Which  of  the  following  symptoms  are  most  bothersom 

Pain 

Numbness  &  Tingling 


r>  r* _ Km/v 


2.  Where  are  your  symptoms  most  bother  some’ 
Neck 

Shoulder  or  shoulder  blade 
Arm  above  elbow 
Axm  below  elbow 
Hands  and/or  fingers 


wh,ch  of  the  following  best  describes  the  behavior  of  your  symptoms^ 
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Constant 

Intermittent 

Variable  (comes  and  goes) 


4.  Does  your  affected  hand  fee!  “fa.”  or  “swollen- 

trouble  with  fumbling  or  dropping  objects  from  your  affected 


5.  Do  you  have 
hand? 


affected  limb  and/or  hand  feel  numb . 
from  falling  asleep? 


6.  Does  your  entire 

7.  Do  your  symptoms  keep  you 

,  wake  you  during  the  night? 

;  with  moving  or  positioning  your  neck- 

‘shaking”,  or  positioning  your 


8.  Do  your  symptoms 

9.  Do  your  symptoms  improve 


10.  Do  your  symptoms  improve  with  movi  g, 

wrist  or  hands? 

1 1  Are  your  symptoms  brought  on  or  mad 
require  a  lot  of  grasping  or  finger  movement? 


when  performing  tasks  that 


■  ,i™  rfthef tpper  Extremity  This  examination 

QfflysmionMJ^  ”  ifief/MSRkMnd  sensation  testing.  ^ 

S'retCh  "  :"indicated  for  patterns  who 
standard  neurological  examination  ofthe  upper  ^  ^  ,0  viikan- 

present  with  radiating  neck  pan.  and  symptom^  ^  ^  ^  ^  (Kappa  .40 
Juntura  found  moderate  m.er-rater  re  , a  i  estprocedures.  Although  the 

64)'54  using  standardized  and  operat.ona  y  q  n(  in  the  evaluation  of 

conventional  neurologic  examination  is  a  stan  ot'CR  and  CTS  has  not 

patients  with  suspected  CR  and  CTS,  its  value  for  the  diagn 
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ur  KpH  21  The  reliability  and  validity  of  conventional  neurolog’ 
been  well  established.  j^e  reliability 

.  ftLP,innpr  extremity  are  summarized  in  1  aDie  . 

examination  procedures  of  PP  Hi  ires  specific  for  patients 

and  validity  of  conventional  neurological  examination  procedures  P 
with  suspected  CTS  are  summanzed  in  Table  2. 


vO 

cn 


<D 

> 

<u 


'~a 

V- 

0 

o 

4__l 

C 

c 

■5 

'5 

CL 

Q- 

rN 

• — ■ 

c 

c 

0 

0 

*0 

-a 

<u 

<d 

cn 

on 

ca 

C3 

cn 

C£ 

r- 

m 


(N  O 

^  o  o 
<u 

00  1  1 
C  o  — 

CO 


a,s.  si  ? 

c  .  ;  © 

CO  ^  0) 

*=  S  “ 

•  <u 
II  II  u 

C  Q.& 

in  m 


-a  ^  id 

.  |jj  y  g 

■~1>  2  —  .2 

CO  >  D  tO 

a  u  ^  >  w 

*  g  -&JS  -5 

o  c  —  '  to 

■B  5  w  ^  a. 

c  cO  ^  V-  P- 

£  ^  30h  j 

i  m  C  O  to 

O  Ji  £  <l> 

3  •£  •£  ca  .5 

£  <*■  ’?  O-  l/l 

CD  o  >  w  a> 

•.=  a  -o  »  s 

o  8  3  -s  e- 

*-  u  «  9. 
u  co  <d  J2  -C 
«2  w  “  c  ^ 
O  .i  C  8  2 

d)  ^  "O  >  o 
u  S  «  C  = 

*-*  CO  <*-  •  — 

c  a)  •-  a - 

•S  c-  S  ^  <u 

£  a:  12  §  3 


to  ^ 

co  a  g 

c+-  £  C  —  T3 
O  *-=  §  <u  o 
to  ’o  ,  > 

U  O.J2  _£  3 
o  T3  —  *o 

C  CO  CO  cj  4J 

*>  C  C  4>  »- 
u.  a>  c  b: 

,<L>  c  ^  -O  — 

6  O  ’o  H  T3 

^  cl*£  •  ^ 

«  °  -O  §  s 

^  t-  _  —  e 

*tn  O  cO  .ti  3 

QJ  .£  <U  *3  -o 
-o  >  ts  §  CD 

g  5  s  y 

C  *o  -Q  -—  -o 
O  CO  <*r  2  u 
3  O  qj  l- 

Ts  .2  c  g  -r 

CO  =  o  «  CO 

co  -Q  •-  oo  c 

-O  CL  CO  -O  £ 

>*  u  c  c  ° 

=  O  ’£  «  c 

CO  O  E  «  r, 

S3  U  5?t3 

-t3  n  <u 

£  «  Q  o  a 


38 


to-  r prviral  ranee-of-motion  (ROM) 

^ange^f^otion_andWnstDianieteLMeasuremeDL  ,«  and 

is  frequently  assessed  when  examining  patients  wit  comp  effectiveness  or  as 

cervical  ROM  measurements  may  be  used  as  an  tndtcator  of  «-*» . «®  . 

io>  Cervical  ROM  is  often  impaired  and  m  y 

a  measure  of  patient  outcome.  Cerv.ca  ,  aired  cervical 

functional  limitations  in  patients  with  cervical  ra  tcu  opa  y  ,67 

flexion  ROM  has  been  described  as  being  usefrtl  for  the  dtagnos  rfCR-  . 

schools  of  manual  therapy  hypothec  that  ceriarn  patterns  of  rt*  ome 

are  indicative  or  pathognomonic  of  a  particular  underlying 
but  no  data  have  been  collected  to  support  this  hypothesis. 

The  cervical  ROM  measures  obtained  in  this  study  include.  M  .  92 

side-bending,  and  bilateral  rotation.  Intra-class  correlation  coe  ^ 

uxmn^gjyfljesTeporting  the  reliability  of  cervical  range-of-motion  measurements 
w,th  a  bubble  goniometer  are  limited.  Lowery  reported  a  Pearson  r  of  .54  for  «*« 

reliability  of  a  cervical  impairment  measurement tahe  whh  a  "I—  of 
and  co, leagues  reported  the  inter-rater  reliability  for  , 

cesvicalftWextension, 

uf^.^andSUrespecttvel,  ~fr  all  measurement  methods.  The  bubble 

The  Wrist-mtio  index  ,s  a  proportion  derived 

(numerator)  and  medial-lateral  (denominator)  wnst  wt  m  more  «square” 

crease  in  centimeters.  Patients  with  a  larger  wrist-ratio  are  ■  «  ^ 

shaped  wrist  which  is  presumed  to  result  in  dimimshed  carpal  c  „ 

said  to  be  predictive  of  CTS,  ,  ,  .  a  ratio  >.70  and 

,983  and  reporied  a  strong  positive  COT*‘“"  has  been  observed 

prolonged  median  distal  sensory  latency.  This  posit 
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U  u  „f  other  authors  including  one  large  prospective  study  of  665 

by  a  number  of  other  au  ,rTS  m  Ina  sample  of  228  patients,  Kuhlman 

consecutive  presenting  for  evaluation  of  CTS.  In  a  samp  for 

and  Hennessy  reported  sensitivity  and  spectfictty  values  o  an  .  ^  ^ 

.  .  .  •  .  when  a  ratio  of  >  70  was  used  to  define  a  positive  test.  E 

::ii — «...  — - «,s  • 

76  175  In  the  one  study  that  reported  no  correlation  between  t  e  wns 

NCS,  the  index  was  computed  from  measurements  taken  at  the  proximo  wnsttfease  of 

asymptomatic  rai.-road  mamtenance  workers.  The  extreme,  ^ 

used  to  establish  the  diagnosis  of  CTS  in  this  study  were  inadequate  on 

s  of  CTS  which  would  be  expected  to  exist  in  an  asymptomatic  population  (d, 
cases  of  CTS,  which  wo  v  index  appears  to  be 

mntnr  and  sensory  latencies  of  >4.4ms).  Althougntnew 

i  rTS  the  reliability  and  measurement 
useful  for  evaluating  patients  with  suspected  CTS,  the 

precision  of  this  clinical  measure  is  not  known. 

Proyoc3£ioilJS5ts:  Provocation  tests  are  procedures 

patient’s  symptoms  and  usually  have  a  dichotomous  outcom  .  P  ^  ^ 

indicate  that  the  target  disorder  has  a  mechanical  componen  ^  ^  ^  ^  is 

i68pp.94. i69.i 77pp. 75-88  basis  for  most  of  the  prov 

mechanical  deformation  (compression  or  tension)  or  alleviation  of 

‘HI^  dlLauon  (distraction  or  relaxation)  of  the  neural  element i— 

decreases,  respectively,  symptoms  or  seven,  of  symptoms  inpatients^  ^  ^ 

CTS.178-181  Mechanical  deformtdon  resuhs °^ose  fjring  threshold  is  elevated 
symptoms  due  to  ischemia  and  irritation  ot  nerve 

.  .  179.181 

due  to  injury. 

provocative  tests  for  patients  with  CR  may  induce  or  alleviate  mechanical  deflation  by 
the  folloM„g  mechamsmsi  element  «  “  ,ntrathecal 

peripheral  n^mlelemem;  ^  wUh  CTS  employ  external 
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,  .  .  88.178.179  utiiize  direct  percussion  of 

already  elevated  pressure  within  the  carpal  canal  ^ 

the  nerve  to  excite  hyperirritable  or  regenerating  axons. 

Each  provocative  test  assessed  in  this  study  is  described  below  along  with  the  operattonai 
definition  of  each  test  as  it  will  be  administered  in  this  study. 

,  Neek  Compression  Tests:  Originally  described  by  Spurling  and  Scoville  as  a  test  for 
cervical  radiculopathy,  .n  thetr  description,  the  patient,  head  is  laterally  flexed  towar  s 

the  side  of  pain,  and  a  compression  force  of  1 5  lbs.  is  applie  tolt  °  ^ 

positive  test  is  defined  as  the  reproduction  of  the  characteristic  ra  icu 

authors  have  modified  this  test  and  incorporate  neck  extenston  and  rotation  towar 

side  of  pain  prior  to  applying  a  compression  force  of  15  lbs  tote 

Both  tes,  procedures  wil,  be  used  in  this  study  and  will  be  appli^h  thepa^  *  » 
sitting  position. 

designated  Method  A  and  the  mod  Following  the  application  and  grading 

performed  first  and  graded  as  positive  or  negat 

of  Method  A.  Method  B  will  then  be  performed  and  graded  in  the  same  manner. 

,  Shoulder  Abduction  Test:  The  shoulder  abduction  test  is  p" 
complaints  of  radiating  neck  pain  or  radicular  signs  and  symp  <®  '  ^ 

mechanism  of  pain  relief  is  unclear,  it  is  thought  to  be  due  to  dimmis  e  ens, 
irritable  nerve  root'8*  A  positive  test  is  defined  as  the  elimination  of  or 

symptoms 

The  Shoulder  abduction  test  is  shown  in  Figure  6.  While  sitting,  the  — 

t0  piace  the  hand  of  the  affected  extremity  on  the  head  in  order  to  support 
the  scapular  plane."4  The  test  will  be  graded  positive  or  negative 
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Figure  6 


3.  Valsalva  Maneuver:  The  Valsalva  maneuver  is  designed  to  detect  a  space-occupying 
lesion  in  the  cervical  spine,  such  as  a  herniated  disk  or  an  osteophyte.  A  positive  test  is 

.  r  ,  150pp.  123-1 27 

defined  as  the  reproduction  or  exacerbation  of  symptoms. 

While  sitting,  the  patient  is  instructed  to  take  a  deep  breath  and  hold  the  breath  while 
attempting  to  exhale  over  a  2-3  second  period  with  gradually  increasing  force.  This  test 
has  been  modified  to  include  the  gradual  force  build-up  period.  Because  of  associated 
morbidity,  the  Valsalva  maneuver  will  not  be  performed  by  patients  in  this  study  who 
have  cardiac  disorders  and  patients  with  ophthalmic  disorders  other  than  visual  acuity 
deficiencies. 192'194  The  test  will  be  graded  positive  or  negative. 

4.  Distraction  Test:  The  neck  distraction  test  is  performed  on  patients  with  complaints  of 
radiating  neck  pain  or  radicular  signs  and  symptoms.150  A  positive  test  is  defined  as  the 
elimination  of  or  decreased  symptoms.  If  positive,  a  cervical  disc  herniation  is  suspected 
and  indicates  the  potential  for  mechanical  traction  to  be  an  effective  treatment  approach. 
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With  the  patient  lying  supine  and  the  neck  comfortably  positioned,  the  rater  will  securely 
grasp  the  patient's  head  under  the  occiput  and  chin.  An  axial  traction  force,  not  to  exceed 
~30  lbs.,  will  then  be  manually  applied  to  the  neck.154  The  test  will  be  graded  positive  or 

negative. 

5.  Upper  Limb  Tension  Tests  (ULTT).  The  ULTT,  or  ‘brachial  plexus  tension  test’  was 
originally  described  by  Elvey  in  1979.195185pp  577-585  Several  modifications  of  Elvey's  test 
designed  to  selectively  stress  the  peripheral  nerves  of  the  upper  extremity  have  since  been 
proposed.  Two  basic  ULTT  procedures  will  be  used  in  this  study  and  are  purported  to 

*  1  177pp.  147-1 53 

emphasize  tension  in  the  median  and  radial  nerve,  respectively. 

ULTT  A:  With  the  patient  supine  and  the  cervical  spine  in  neutral,  the  following 
motions  will  be  sequentially  applied  to  the  symptomatic  upper  extremity  and  are 
illustrated  in  Fig  7  on  the  following  page:  1)  scapular  depression  (A),  2)  gleno-humeral 
abduction  (B),  3)  forearm  supination,  wrist  and  finger  extension  (C),  4)  shoulder  external 
rotation  (D),  5)  elbow  extension  (E),  and  6)  contralateral  then  ipsilateral  cervical  lateral 
flexion  (F).  The  patient  is  questioned  regarding  symptom  reproduction  throughout  the 
maneuver.  If  symptoms  are  not  reproduced  during  testing  of  the  symptomatic  limb,  the 
test  will  then  be  applied  to  the  opposite  limb  in  an  identical  manner  in  order  to  compare 
elbow  extension  range-of-motion  between  limbs.  195’185pp'577‘585  The  test  is  considered 
positive  in  this  study  if  the  following  conditions  are  met:  1)  the  test  reproduces  any 
portion  of  the  patient’s  symptoms  or  pain  complaints,  2)  there  are  side-to-side  differences 
in  elbow  extension  when  all  previous  motion  sequences  have  been  completed,  and  3)  for 
the  symptomatic  limb,  contralateral  neck  lateral  flexion  increases  symptoms  or  ipsilateral 
lateral  neck  flexion  decreases  symptoms.  When  a  positive  result  occurs,  the  examiner 
will  note  and  record  the  element  in  the  range-of-motion  test  sequence  (1-6)  that  elicited 
the  positive  result.  The  test  is  concluded  when  a  positive  result  is  obtained  or  when  all 
motion  sequences  have  been  completed. 

ULTT  B:  With  the  patient  supine,  shoulder  abducted  to  30°,  and  the  cervical  spine  in 
neutral,  the  following  motions  will  be  sequentially  applied  to  the  symptomatic  upper 
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extremity  and  are  illustrated  in  Fig  8:  1)  scapular  depression  (A),  2)  medial  shoulder 
rotation  (A),  3)  full  elbow  extension  (“locked”)  (B).  4)  wrist  and  finger  flexion  (C),  and  5) 
contralateral  then  ipsilateral  cervical  lateral  flexion  (as  for  ULTT  A).  The  patient  is 
questioned  regarding  symptom  reproduction  throughout  maneuver.  If  symptoms  are  not 
reproduced  during  testing  of  the  symptomatic  limb,  the  test  will  then  be  applied  to  the 
opposite  limb  in  an  identical  manner  in  order  to  compare  wrist  flexion  range-of-motion 
between  limbs.  The  test  is  considered  positive  if  either  of  the  following  conditions  are 
met:  1)  the  test  reproduces  any  portion  of  the  patient’s  symptoms  or  pain  complaints.  2) 
there  are  side-to-side  differences  in  wrist  flexion  when  all  previous  motion  sequences 
have  been  completed,  and  3)  for  the  symptomatic  limb,  contralateral  neck  lateral  flexion 
increases  symptoms  or  ipsilateral  lateral  neck  flexion  decreases  symptoms.  When  a 
positive  result  occurs,  the  examiner  will  note  and  record  the  element  in  the  range-of- 
motion  test  sequence  (1-6)  that  elicited  the  positive  result.  The  test  is  concluded  when 
a  positive  result  is  obtained  or  when  all  motion  sequences  have  been  completed 


>  ‘  h:  v  -v  m 
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Figure  7  (Adapted  from  Butler177) 


44 


Figure  8  (Adapted  from  Butler177) 


6.  Tinel’s  Sign:  Two  methods,  classic  (Method  A)  and  provocative  (Method  B),  will  be 
performed.  Both  methods  are  reported  to  localize  the  level  of  a  peripheral  nerve  injury 
by  performing  percussion  directly  over  the  nerve  suspected  to  be  involved.196  The  test 
was  originally  described  as  a  method  of  detecting  and  monitoring  nerve  regeneration  after 
laceration.188  Others  have  described  or  applied  the  test  as  a  provocative  measure  in  order 
to  reproduce  the  patient’s  symptoms.16  In  compression  injuries  such  as  carpal  tunnel 
syndrome,  regeneration  will  only  occur  after  the  syndrome  has  progressed  to  the  point  of 
nerve  degeneration.196  In  the  early  stages  of  carpal  tunnel  syndrome,  or  in  advanced 
cases  in  which  the  degeneration/regeneration  process  has  reached  a  steady  state,  Tinel’s 
Sign  may  be  negative,  even  though  the  syndrome  is  present.  In  the  case  of  suspected 
carpal  tunnel  syndrome,  the  median  nerve  is  percussed  over  the  carpal  tunnel.150 
Method  A-  With  the  patient  sitting,  elbow  flexed  0-30°,  and  the  forearm  in  a  supinated 
position,  the  patient's  wrist  and  hand  will  be  supported  in  a  neutral  position.  A  tendon 
reflex  hammer  positioned  ~6  in.  above  the  wrist  will  be  allowed  to  fall  4-6  times  over  the 
median  nerve  located  between  the  tendons  of  the  flexor  carpi  radialis  and  the  palmaris 
longus  at  the  proximal  wrist  crease.  A  positive  sign  is  considered  to  be  present  when  the 
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patient  reports  a  non-painful  tingling  sensation  radiating  distally  along  the  course  of  the 
nerve.18 8,196  The  test  will  be  graded  positive  or  negative 

Method  B-  The  patient  will  be  positioned  as  for  method  A  above.  Using  a  tendon  reflex 
hammer,  the  Rater  will  directly  percuss  the  median  nerve  located  between  the  tendons  of 
the  flexor  carpi  radialis  and  the  palmaris  longus  at  the  proximal  wrist  crease  4-6  times 
with  mild  to  moderate  force.  A  positive  sign  is  considered  to  be  present  when  the  patient 
reports  discomfort  or  pain  at  the  wrist  or  radiating  distally  along  the  course  of  the  nerve 
that  is  related  to  their  condition.  The  test  will  be  graded  positive  or  negative. 

7.  Phalen’s  Test:  Phalen’s  wrist  flexion  test  was  developed  as  a  clinical  test  for  carpal 
tunnel  syndrome.157  Maximal  wrist  flexion  decreases  the  cross-sectional  area  of  the 
carpal  tunnel  and  compresses  the  median  nerve  between  the  flexor  tendons  and  the 
transverse  carpal  ligament.197  A  positive  test  is  defined  as  the  reproduction  or 
exacerbation  of  paresthesias  or  anesthesia  in  the  cutaneous  distribution  of  the  median 
nerve  in  the  hand.157 

While  sitting,  the  patient's  elbow  will  be  flexed  0-30°  and  the  forearm  and  wrist  will  be 
supported  by  the  Rater  in  a  pronated  and  neutral  position,  respectively.  The  patient's 
wrist  will  then  be  placed  in  a  position  of  maximal  flexion  for  a  maximum  of  sixty- 
seconds.157  For  this  study,  the  patient  will  be  questioned  with  regard  to  symptoms  at  15 
second  intervals  during  the  sixty-second  period.198  The  test  will  be  graded  positive  or 
negative.  The  test  is  concluded  when  a  positive  test  result  is  obtained  or  at  the  end  of  the 
maximum  sixty-second  time  period. 

8.  Carpal  Compression  Test  (CCT):  The  CCT  was  originally  described  by  Durkan186  as 
a  clinical  diagnostic  test  for  carpal  tunnel  syndrome.  The  test  is  considered  positive  if  the 
patient’s  symptoms  in  the  cutaneous  distribution  of  the  median  nerve  are  reproduced. 

While  sitting,  the  patient's  elbow  will  be  flexed  0-30°  and  the  forearm  and  wrist  will  be 
supported  in  a  supinated  and  neutral  position,  respectively.  Placing  both  both  thumbs 
over  the  transverse  carpal  ligament,  the  rater  will  then  apply  a  approximately  6  pounds  of 
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I  0£ 

pressure  with  each  thumb.  The  pressure  is  maintained  for  a  maximum  of  30  seconds. 

For  this  study,  the  patient  will  be  questioned  with  regard  to  symptoms  at  15  second 
intervals  during  the  thirty-second  period.  The  test  will  be  graded  positive  or  negative  and 
is  concluded  when  a  positive  test  result  is  obtained  or  at  the  end  of  the  maximum  thirty- 
second  time  period. 

The  diagnostic  properties  and  reliability  coefficients  of  each  provocative  test,  if  known, 
are  reported  in  Tables  3  and  4.  Unlike  tests  of  CR,  a  large  number  of  studies  of  CTS 
have  reported  a  wide  range  of  diagnostic  accuracy  values  for  Phalen’s  test,  Tinel’s  Sign, 
and  the  CCT  (Table  4).  One  explanation  for  the  variabity  of  these  findings  is  errors  in 
diagnostic  test  methodology.  A  summary  of  CTS  provocative  test  studies  and  associated 
methodology  is  listed  Figures  9  and  10 


Table  3 


Reported  Reliability  and  Validity  Coefficients  for  CR  Provocative  Tests 

TEST  PROCEDURE 

RELIABILITY 

VALIDITY 

Spurling’s21 154 

(range) 

Gold  Standard: 

iT 

i 

VO 

II 

U 

Myelography 

Sn=  .36  Sp=  .96 

(range) 

Gold  Standard:  Myelography 

Shoulder  Adbuction  Test21  154 184 

O 

Cn) 

11 

Sn=  .43  Sp=  .80 

%  agreement=  68% 

Valsalva  Maneuver 

Has  not  been  reported 

Has  not  been  reported 

Neck  Distraction21 154 

o 

to 

II 

Gold  Standard:  Myelography 

Sn=  .40  Sp=  1.0 

ULTT  (may  also  used  with  CTS 
patients)154 

K=  .35 

Has  Not  Been  Reported 
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Table  4 


Reported  Reliability  and  Validity  Coefficients  for  CTS  ProvocationTests 


TEST  PROCEDURE 

Phalen’s16'157'160'162"164186'199'213 

RELIABILITYS 

VALIDITY# 

Intrarater  K=  .53 

*Sn=  .48  -  .88 

Interrater  K=  .63 

Sp=  .32  -  .90 

Tine!  ’s 1 6- 1 57* 1  ^ 1 62* 1 64  ■ 1 86- 1  "*20 1,203-2 1 4 

Intrarater  K=  .80 

Sn=  .25  -  .74 

Interrater  K=  .79 

Sp=  .59  -  .97 

££  jl  60. 1 86. 1 99-20 1 ,209.2 1 5.2 1 6 

Has  not  been  reported 

Sn=.21  -  .89 

213.217 

+Sp=  .33  -  .96 

#Gold  standard  is  NCS  or  NCS  &  compatible  CTS  symptoms  in  almost  all  cases  (see  Table  6  for  detail) 
SReliability  coefficients  come  from  a  single  study212 
*Extreme  value  of .  1 1 164  has  been  reported 
+Extreme  value  of  .08160  has  been  reported 
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TEST/.SYwc/v 

METHODOLOGICAL  PRINCIPLE 

Phalen’s  and  Tinel’s 

Clear 

Operational 

Definition? 

Gold  Standard 

Spectrum? 

Bias? 

Borg  et.al 163 

P=No:  T=No 

NCS 

3;  a 

2*,3* 

De  Krom  et.  al 160 

P=Yes;  T-Yes 

NCS/CTS  Sx’s 

U;a 

None 

De  Smet  L  et.  al. 199 

P=Yes;  T=No 

NCS 

U;  a  &  b 

2*,3* 

Durban  JA lH 6 

P=Yes;  T=N 

NCS 

3;  b 

2*,  3* 

Gellman  H et.  al.20'* 

P=Yes:  T=N 

NCS/EMG 

2;  b 

1,2*,3* 

Golding  DN  et.  al. 164 

P=No;  T=No 

NCS 

3;  c 

2*,3* 

Gonzalez  J  et.  al 200 

P=Yes;T=Yes 

Clinical  exam 

abnormalities 

and  surgical 

relief 

2;  b 

2*,3* 

Heller  L  et.alm 

P=Yes;  T=No 

NCS/EMG 

U;  c 

2*,3* 

Katz  JN  et.  al.16 

Kuschner  SH et.  al.206 

P=Yes:  T=Yes 

NCS/EMG 

U;  c 

None 

Moss  man  SS  et.  al.201 

P=No;  T=Yes 

NCS/EMG 

3;  unknown 

1,2*,3* 

++Phalen  GS'57 

P=Yes;  T=No 

Clinical  opinion 

3;c 

1,2,4 

Novak  CB  et.  al.2" 

Rietz  KA  et.  al2m 

P=Yes:  T=Yes 

Clinical  Sx’s 

U;  c 

2*,3* 

Stewart  JD  et.  al.  (Tinel 's 

only)207 

No 

NCS/Clincal 

Sx’s 

U;  b 

2*,3* 

Seror  P  (Phalen 's  only  )202 

Yes 

NCS 

U;  b 

2*,3* 

Seror  P  (Tinel ’s  only)2'8 

No 

NCS/EMG 

2 

2*, 3* 

Szabo  et.  al.2" 

P=Yes;  T=  No 

Surgical  relief 

of  symptoms 

2;  a,&  b 

2*,3*,4 

Tetro  MA2W 

P=Yes;  T=No 

NCS/EMG 

U;  b 

2*,3* 

Williams  TM2W 

Gelmers  HG  (Tinel ’s  only)2" 

P=Yes:  T=No 

Clinical  Sx’s 

U;  b 

2* 

P=  Phalerf  s 
T=  Tinel’s 

*=  Report  did  not  exclude  possibility 
U=  Unknown 


■=  Retrospective 


Figure  9  (cont’d) 


Types  of  Bias: 


Spectrum: 


1=  Work-up  Bias 
2=  Diagnostic-Review 
3=  Test-Review 
4=  Incorporation 

Target  Condition  Severity  (stated  or  per  NCS/EMG  findings)- 
1=  Mild/moderate 
2=  Severe 
3=  1  &  2  above 
Control  Group 

a=Other  or  competing  conditions,  similar  symptoms 

b=  Asymptomatic.  “normal”  subjects 

c=None 


Figure  9:  Study  Methodology  for  Phalen’s  Test  (P)  and  Tinel’s  Sign  (T) 


TEST /Study 

METHODOLOGICAL  PRINCIPLE 

CCT 

Clear 

Operational 

Definition? 

Gold  Standard 

Spectrum? 

Bias? 

Durban  JA lH6 

Yes 

NCS 

3;  b 

2*, 3* 

Durban  JA2h 

Yes 

NCS/CTS  Sx's 

3;  b 

2*,3* 

Gonzales  J  et.  al 200 

Yes 

Clinical  exam 

abnormalities 

and  surgical 

relief 

2;  b 

2*,3* 

DeKromMCet.  al160 

Yes 

NCS/CTS  Sx’s 

U:  a 

None 

DeSmit  Let.  alm 

No 

NCS 

U,a  &  b 

2*, 3* 

Tetro  MA  et.  alm 

Yes 

NCS 

U;b 

■  2*, 3* 

Mossman 201 

No 

NCS/EMG 

3;  unknown 

1,2*, 3* 

Szabo  et.  al2h 

Yes 

Surgical  relief 

of  symptoms 

2 

2*, 3*, 4 

Wainner  RS  et.  al 216 

Yes 

NCS 

3.c 

2 

*=  Report  did  not  exclude  possibility 
U= Unknown 

Types  of  Bias:  1  =  Work-up  Bias 

2=  Diagnostic-Review 
3=  Test- Review 
4=  Incorporation 

Spectrum:  Target  Condition  Severity - 

1=  Mild/moderate 
2=  Severe 
3=  1  &  2  above 
Control  Group 

another  or  competing  conditions,  similar  symptoms 

b= Asymptomatic,  “normal”  subjects 

c=None 


Figure  10:  Study  Methodology  for  CCT 
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There  is  a  dearth  of  research  related  to  the  efficacy  of  provocative  tests  for  the  diagnosis 
of  CR.  In  contrast,  a  great  deal  of  literature  has  been  published  regarding  the  efficacy  of 
provocative  tests  for  the  diagnosis  of  CTS  but  due  to  the  range  of  findings,  few 
conclusions  can  be  drawn.  It  is  clear  from  the  available  evidence  that  the  reliability  and 
validity  of  commonly  used  provocative  tests  for  the  diagnosis  of  CR  and  CTS  is  not  well 
established. 

2.4,2  Common  Patient  Self-Report  Measures 

All  but  two  of  the  patient  self-report  instruments  assessed  in  this  study  are  HSAM. 

Health  status  assessment  measures  can  be  used  for  three  broad  purposes  which  have  been 
described  as  discriminative,  predictive,  and  evaluative.142  A  discriminative  instrument  is 
used  to  distinguish  between  individuals  or  groups  based  on  an  underlying  dimension 
without  reference  to  a  gold  standard.  A  predictive  instrument  is  used  to  classify 
individuals  into  distinct  categories  based  on  comparison  with  a  gold  standard  in  order  to 
identify  individuals  who  have  or  will  develop  a  target  condition  or  outcome.  An 
evaluative  instrument  is  used  to  assess  clinically  meaningful  change  over  time142 

An  HSAM  that  demonstrated  excellent  psychometric  properties  for  all  three  purposes 
mentioned  above  would  be  ideal.  However,  properties  of  an  instrument  that  maximize 
one  of  the  previously  mentioned  three  purposes  is  likely  to  limit  the  ability  of  the 
instrument  to  fulfill  the  other  two  purposes  well.142  Several  psychometric  properties  are 
essential  before  a  HSAM  or  any  patient  self-report  instrument  can  be  meaningfully  used 
for  the  patient  management  and  include:  reliability,  validity,  internal  consistency,  and 
responsiveness  to  clinical  change.219  The  psychometric  properties  of  the  all  the  patient 
self-report  instruments  included  in  this  study  are  acceptable  and  are  discussed  below.220' 

223 


Clinicians  have  been  reluctant  to  incorporate  valid  patient  self-report  instruments,  in 
particular  HSAM,  into  clinical  practice  despite  the  fact  that  they  have  been  available  for 
the  last  20  years  and  are  often  more  valid,  reliable,  and  responsive  than  the  traditionally 
used  clinical  examination  measures  of  impariment.144  224  A  reason  often  given  for  this 
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reluctance  is  that  they  impose  an  excessive  time  burden  on  both  the  patient  and 
practitioner.144  Therefore,  the  use  of  patient  self-report  instruments  in  patient  care  should 
be  done  in  a  parsimonious  fashion.  If  the  self-report  instruments  used  in  this  study  are 
capable  of  fulfilling  more  than  one  purpose  (discriminative,  predictive,  evaluative),  then 
both  clinician  and  respondent  burden  will  be  eased  and  it  may  facilitate  more  frequent  use 
in  clinical  practice.  Copies  of  all  patient  self-report  instruments  are  located  in 
appendix  B. 


The  Neck  Disability  Index  fNDI):  The  purpose  of  the  NDI  is  to  evaluate  change  over 
time  in  patients  with  neck  pain.  Vernon  and  Mior  developed  the  NDI  by  modifying  the 
Oswestry  Low  Back  Pain  Disability  Index,  which  is  a  region-specific  self-report  measure 
of  disability  for  patients  with  low  back  pain.220  The  authors  first  identified  issues  and 
activities  considered  most  relevant  to  assessing  the  needs  of  patients  with  neck  pain  and 
submitted  them  to  a  group  of  clinical  practitioners  for  review  and  consensus  rating.  The 
resulting  items  were  then  pilot  tested  in  a  group  of  5  patients  with  whiplash  injury.  The 
final  NDI  consists  of  five  items  from  the  original  Oswestry  Index,  two  of  which  were 
revised  considerably,  and  five  new  items.  Seven  of  the  items  are  related  to  activities  of 
daily  living,  two  are  related  to  pain,  and  one  item  addresses  concentration  (ability  to 
read).  The  original  Oswestry  Index  format  was  retained  and  the  terminology  of  the 
response  statements  were  modified  and  made  relevant  for  patients  with  neck  pain.  The 
six  response  statements  are  scaled  from  0  to  5,  with  0  representing  no  involvement  or 
difficulty  and  5  representing  severe  involvement  or  difficulty.  The  total  NDI  score  is 
derived  by  summing  the  ratings  of  all  10  items  so  that  a  score  of  0  represents  good 
function  while  a  score  of  50  represents  poor  function.  Riddle  and  Stratford  successfully 
used  an  alternative  scoring  strategy  that  is  similar  to  that  of  the  Oswestry  and  accounts 
for  items  left  blank  by  respondents.  A  percentage  score  is  obtained  by  dividing  the 

225 

patients  score  by  the  maximum  possible  score  for  the  number  of  items  answered. 
Although  developed  as  an  evaluative  measure  for  patients  with  whiplash  and  chronic 
neck  pain,  the  NDI  has  also  been  evaluated  in  patients  with  a  wide  variety  of  acute  and 

225  226 

chronic  neck  disorders. 
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Vernon  &  Mior  administered  the  NDI  to  17  patients  during  an  initial  visit  and  then  two 
days  later,  prior  to  the  initiation  of  treatment.  Test-retest  reliability  was  reported  as  a 
Pearson  r  of  .89  and  internal  consistency  of  the  instrument  was  good  (Chronbach’s 
alpha=.80).220  Binkley  also  found  the  NDI  to  have  a  high  level  of  test-retest  reliability 
(ICC=.89)  when  administering  the  instrument  3  days  apart  in  a  sample  of  31  patients 
suffering  from  a  variety  of  neck  disorders.226  Construct  validity  of  the  NDI  is  good  and 
has  been  assessed  in  multiple  settings  using  a  variety  of  methods.  Vernon  and  Mior 
found  the  NDI  to  be  moderately  correlated  with  a  pain  VAS  (r=.60)  and  total  scores  from 
the  McGill  Pain  questionnaire  (r=.70).220  In  the  study  by  Riddle  et.  al.,  the  NDI  was 
moderately  correlated  with  clinician  prognosis  ratings  (r=.66)  as  well  as  the  physical 
(PCS;  r=.53)  and  mental  (MCS;  r=.47)  component  summary  scales  of  the  Short-Form 
Health  Survey  (SF-36).225  In  addition,  the  NDI  was  more  responsive  than  the  PCS  and 
MCS  for  detecting  change  in  functional  status  between  patients  with  different  work  status 
due  to  neck  pain  (altered  vs  not  altered).  Binkley  reported  a  minimal  level  of  detectable 
change  for  the  NDI  of  4.2  points.226 

Brigham  and  Women’s  Hospital  Hand  Symptom  Severity  Scale  (SSS)  &  Function  Status 
Scale  fFSSV  The  purpose  of  the  SSS  and  FSS  is  to  evaluate  change  over  time  in  patients 
with  CTS.  The  hand  SSS  and  FSS  were  developed  by  Levine  et.  al.  in  1993  as  condition- 
specific  scales  to  be  used  in  the  evaluation  and  assessment  of  outcome  in  patients  with 
Cts  221  The  SSS  consists  of  1 1  statement  items  related  to  six  domains  said  to  be  critical 
for  the  evaluation  of  CTS.  These  six  domains,  identified  by  a  panel  of  hand  surgeons, 
rheumatologists,  and  patients,  include:  pain;  paresthesia;  numbness;  weakness;  nocturnal 
symptoms;  and  over-all  functional  status.  Each  statement  is  rated  by  the  patient  on  a  1 
point  (mildest)  to  five  point  (most  severe)  Likert  scale.  An  overall  SSS  score  is  obtained 
by  calculating  the  mean  of  the  1 1  individual  items.  A  higher  overall  SSS  score  represents 
more  severe  symptoms  and  lower  scores  milder  symptoms.  The  FSS  consists  of  eight- 
items  related  to  a  variety  of  activities  commonly  performed  by  a  broad  spectrum  of 
patients  (i.e.  young  and  elderly,  workers  inside  and  outside  the  home). 
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Each  activity  is  rated  by  the  patient  on  1  (no  difficulty)  to  5  (cannot  do  at  all)  Likert 
scale.  An  overall  FSS  score  is  obtained  by  calculating  the  mean  of  the  8  individual  items. 

A  higher  overall  FSS  score  represents  greater  disability  and  and  lower  scores  are 
representative  of  less  disability. 

The  psychometric  properties  of  both  the  SSS  and  FSS  are  acceptable.  Levine  et.  al. 
assessed  the  test-retest  reliability  of  both  scales  in  a  sample  of  67  patients  with  confirmed 
CTS.  Each  scale  was  administered  on  two  consecutive  days  and  Pearson  correlation 
coefficient’s  of  .91  and  .93  were  computed  for  the  SSS  and  FSS,  respectively.221  Because 
no  universally  accepted  gold  standard  exists  for  measuring  the  severity  of  symptoms  or 
functional  status  of  the  hand,  scale  validity  was  assessed  by  correlating  the  SSS  and  FSS 
scores  with  impairment  measures.142  It  was  hypothesized  that  more  severe  symptoms 
would  be  positively  but  weakly  correlated  with  greater  sensory  and  functional  limitation 
measures.  All  correlations  were  in  the  expected  direction  and  ranged  from  weak  to 
modest.  The  following  Spearman  correlation  coefficients  were  obtained:  grip  strength 
and  pinch  strength=  .38  and  .47  for  the  SSS  and  .50  and  .60  for  the  FSS,  respectively  and 
two-point  discriminations  15  (SSS)  and  .42  (FSS).  Internal  consistency,  as  measured 
by  Chronbach’s  alph,  was  reported  to  be  .89  for  the  SSS  and  .91  for  FSS.  An  effect  size 
of  1. 1  for  the  SSS  and  .71  for  the  FSS  was  obtained  three  months  after  surgery  and 
indicated  that  both  scales  are  sensitive  to  change  in  post-surgical  CTS  patients. 

The  psychometric  properties  reported  by  Levine  et.  al.  for  the  SSS  and  FSS  have  been 
replicated  by  other  authors  in  multiple  clinical  settings,227  study  designs,227  229  and  with 
patients  receiving  worker  compenstation.229  In  addition,  both  the  SSS  and  FSS  have 
been  shown  to  be  more  responsive  than  physical  examination  measures  and  generic  or 
region  specific  patient  self-report  measures. 

Hand  Diagram  &  10-cm  visual  analogue  scale  (VAS):  Pain  drawing  instruments  or 
diagrams  have  proven  useful  for  diagnostic  and  predictive  purposes  in  patients  with 
CTS,16'223’230’231  patients  with  low  back  pain,232’233  and  are  used  primarily  for 
psychological  screening  purposes.  A  pain  diagram  is  usually  administered  by  having  the 
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patient  mark  the  location  of  their  symptoms  on  an  anatomic  diagram  with  symbolic 
markings  descriptive  of  symptoms.  The  hand  diagram  developed  by  Katz  and  Stirrat  for 
use  in  the  evaluation  of  patients  with  CTS  consists  of  anatomic  images  that  depict  the 
anterior  and  posterior  surfaces  of  the  left  and  right  hands  as  well  as  the  entire  left  and 
right  upper  extremities.223  The  hand  images  are  located  in  the  in  the  center  of  the  page 
and  bordered  by  the  upper  extremity  images  on  each  outside  comer.  In  addition,  a  10cm 
VAS  for  pain  intensity  is  included  at  the  bottom  of  the  instrument.  The  10cm  pain  VAS 
has  been  used  extensively  as  an  indicator  of  patient  response  to  treatment  and  possesses 
construct  validity,234’235  is  responsive  to  change,236  and  has  excellent  test-retest  reliability 
(r=  99)  237  The  original  descriptors  used  by  patients  to  complete  the  hand  diagram  were 
pain,  numbness,  tingling,  and  decreased  sensation,223  with  a  marking  symbol  peculiar  to 
each  descriptor.  In  a  later  study  the  authors  collapsed  the  descriptors  of  numbness, 
tingling,  and  decreased  sensation  into  a  single  response  category  because  they  frequently 
overlapped  and  it  was  difficult  for  some  to  distinguish  the  difference  between  these 
descriptors.231  The  diagram  is  graded  by  classifying  the  patient  as  having  classic, 
probable,  possible  and  unlikely  CTS  based  the  areas  of  the  diagram  that  are  marked. 

In  a  sample  of  63  patients  treated  for  upper  extremity  paresthesias,  the  sensitivity  and 
specificity  of  the  Hand  Diagram  was  reported  to  be  .80  and  .90,  respectively.  Intra  and 
Inter-rater  reliability  was  determined  for  54  randomly  selected  Hand  Diagrams  and 
reported  as  a  percent  agreement  of  91%  and  84%,  respectively.  This  initial  report  was 
retrospective  and  CTS  prevalence  was  88%,  which  limited  conclusions  about  the  validity 
of  the  Hand  Diagram.  However,  other  large,  prospective  trials  assessing  the  diagnostic 
accuracy  of  the  Hand  Diagram  alone  and  in  combination  with  other  diagnostic  tests  have 
reported  sensitivities  that  range  from  .61 16  to  .64  230  and  specificities  that  range  from  and 
.71 16  to  .73230,  respectively.  There  is  limited  evidence  that  the  hand  diagram  may  be 
useful  for  prognosis  in  patients  with  CTS  who  are  treated  surgically.231  Later  studies 
included  patients  with  a  variety  of  upper  extremity  disorders,  workers  compensation 
cases,  and  are  of  stronger  methodological  quality.  In  summary,  the  Hand  Diagram 
appears  to  be  a  useful  self-report  instrument  for  the  diagnosis  of  CTS  and  may  be  useful 
for  predicting  outcome  in  surgically  treated  patients. 
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ti,.  Fear  Avoidance  Behavior  Questionnair^FABmi  Lethem  and  Slade  et.  al.  proposed 
the  Fear-Avoidance  Model  of  Exaggerated  Pain  Perception  which  is  based  on  data 
concerning  the  pain  coping  strategies  used  by  patients.238'239  The  model  proposes  that  an 
individual’s  response  to  pain  can  be  described  on  a  continuum,  from  a  minimal  fea 
the  painful  symptoms  and  motivation  to  continue  normal  activities  to  the  greatest  extent 
possible,  to  a  strong  fear  of  painful  symptoms  and  an  avoidance  of  painful  activities. 

Patients  who  respond  in  the  former  manner  may  be  described  as  “confronted”  and  those 
in  the  latter  manner  as  “avoiders”.  Confronted  will  tend  to  rehabilitate  themselves  while 
avoided  become  increasingly  deconditioned  and  disabled  as  a  result  of  their  avoidance 

behavior. 

The  FABQ  is  a  self-report  measure  developed  by  Waddell  et.  al.  in  order  to  measure  the 
fear-avoidance  beliefs  of  low  back  pain  patients.222  The  FABQ  consists  of  16  items  and 
has  a  two  factor  structure.  One  factor  concerns  fear-avoidance  beliefs  related  to  work  (11 
items)  and  the  other  factor  concerns  fear-avoidance  beliefs  related  to  general  physical 
activity  (five  items).  The  FABQ  has  been  demonstrated  to  have  acceptable  psychometnc 

..  222 
properties. 

In  a  prospective  study  of  300  patients  with  acute  low  back  pain,  Klenerman  et.  al.  found 
the  FABQ  and  several  other  indicators  of  fear-avoidance  beliefs  to  be  the  best  predictors 
of  which  patients  condition  would  become  chronic.240  Fritz  et.  al.  found  the  work  FABQ 
to  be  the  best  predictor  of  return  to  work  at  four  weeks  for  a  group  of  67  patients  with 
occupationally  related  acute  LBP:  sensitivity  was  perfect  Sn  (1.0)  and  Sp  was.63  when  a 
cut-off  score  of  30  is  used  (LR+=  2.7,  LR-=  02).241  The  FABQ  has  not  been  used  to 
predict  chronicity  for  patients  with  CR  or  CTS.  The  use  of  the  FABQ  for  patients  with 
CR  and  CTS  in  this  study  is  considered  acceptable  for  the  following  reasons,  the  fear 
avoidance  model  is  based  on  the  coping  strategies  of  patients  with  a  variety  of 
conditions;238'239  the  FABQ  has  acceptable  psychometric  properties;222  and  the  FABQ 
assesses  factors  that  would  not  be  limited  to  patients  with  LBP  (i.e.  fear  of  pain  related  to 
work  and  general  physical  activity)238  239 
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3.0  RESEARCH  HYPOTHESES 


1  Reliability:  the  clinical  examination  variables 

listed  below  will  demonstrate  the 

following  specified  levels  of  reliability 

a.  Excellent  (K  >  .75  or  ICC  >.90) 

CTS  Variables 

CR  Variables 

Questions:  1.- “Most  bothersome  symptoms..”,  1.-  ‘"Most  bothersome  symptoms.. 

2.-  ‘‘Where  most  bothersome.” 

2.-  “Where  most  bothersome.. 

3.-  “Symptom  behavior. .” 

3.-  “Symptom  behavior.” 

4.-  “Hand  fat/swollen.” 

6.-  “Entire  limb  numb  .” 

5.-  ‘Tumbling/dropping..” 

1-  “Symptoms  keep  from  sleep.  ” 

6.-  “Entire  limb  numb.  ” 

9.-  “Neck  movement  improves.  ” 

8  -  “Night  symptoms  wake.  ” 

10.-  “Hand  shaking  improves.” 

11.-  “Worse  with  hand  use  .” 

Wrist  Ratio 

Neck  ROM 

Tinel’s  A  &  B 

b.  Fair  to  Good  (K=  .40  -  .75)  or  Good  (ICC-  .75  -  .90) 

CTS  Variables 

CR  Variables 

Phalen’s 

Spurling’s  A  &  B  Sensation 

CCT 

Shoulder  Abduction  MMT 

Sensation 

Valsalva  MSR  s 

MMT 

Neck  Distraction 

c.  Poor  (K<. 40)  or  Poor  to  Moderate  (ICC  <  75) 

CTS  Variables 

CR  Variables 

ULTT  A  &  B 


ULTT  A  &  B 
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2.  Test  Diagnostic  Accuracy:  the  concurrent  validity  of  the  clinical  examination 
variables  listed  below  will  be  determined  to  be  acceptable  or  unacceptable  based  on  the 
following  criteria.54'55 

a.  Acceptable  (Sn  or  Sp  >  .70  or  LR+  >  2.0  or  LR-  <  .50) 


CTS  Variables 

Questions  2,4,5,8,10,  and  1 1 

Phalen’s 

TinePs  A  and  B 

CCT 

Wrist  Ratio 
Hand  Diagram 


CR  Variables 


Questions  2  and  7 

Sensation 

MMT 

MSR 


Valsalva 

Neck  Distraction 
(Spurling’s  A  and  B 
Shoulder  Abduction 


b.  Unacceptable  (criteria  for  acceptability  not  met) 


CTS  Variables  CR  Variables 

Questions  1,3,  and  6  ULTT  A  and  B  Questions  1,3,  6  and  9 

Sensation  10cm  VAS  ULTT  A  and  B 

MMT  Neck  ROM 

SSS&FSS  NDI 

10cm  VAS 


3.  Test  Predictive  Validity:  As  with  test  diagnostic  accuracy,  the  predictive  validity  of 
the  clinical  examination  variables  listed  below  will  be  determined  to  be  acceptable  or 
unacceptable  based  on  the  following  criteria..54,55 

a.  Acceptable  (Sn  or  Sp  >  .70  or  LR+  >  2.0  or  LR-  <  50) 


CTS  Variables 

Question  6  and  1 0 
Wrist  Ratio 
Hand  Diagram 


CR  Variables 

Questions  6  and  9 
Neck  Distraction 
Shoulder  Abduction 


EMG/NCS 

FABQ 


EMG/NCS 

FABQ 
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b.  Unacceptable  (criteria  for  acceptability  not  met) 


CTS  Variables 

Questions  1-5,8,10,1 1 

Sensation 

MMT 

Tinel’s  A  and  B 
CCT 
Phalen’s 
ULTT  A  and  B 
10cm  VAS 
SSS&FSS 


CR  Variables 

Questions  1-3,  and  7 

Sensation 

MMT 

Neck  ROM 

Spurling’s  A  and  B 

Valsalva 

ULTT  A  and  B 

NDI 

10cm  VAS 


4.  Test  Item  Cluster  (TIC): 

a.  It  is  hypothesized  that  for  both  CTS  and  CR,  a  combination  of  clinical 
examination  variables  and/or  patient  self-report  items  can  be  identified  that  yield 
acceptable  levels  of  diagnostic  accuracy  (based  on  previous  definition  of  acceptable). 

b.  It  is  hypothesized  that  for  both  CTS  and  CR,  a  combination  of  clinical 
examination  variables,  and/or  patient  self-report  items,  and/or  EMG/NCS  findings,  can 
be  identified  that  yield  acceptable  levels  of  predictive  validity  for  type  of  intervention  and 
patient  perception  of  improvement,  (based  on  previous  definition  of  acceptable). 


If  acceptable  test  inter-rater  reliability,  diagnostic  accuracy,  and  predictive  validity  is 
established  for  the  clinical  examination  and  patient  self-report  measures  considered  in 
this  study,  the  benefits  realized  include  but  are  not  limited  to:  interpretable  test  results; 
more  accurate  clinical  decision  making  with  regard  to  diagnosis  and  treatment;  more 
accurate  estimation  of  patient  prognosis;  and  a  substantial  reduction  in  medical  costs  and 
patient  discomfort.  Acceptable  test  reliability  and  validity  will  allow  further  research  of 
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the  predictive  validity  of  these  techniques,  permit  their  wide  application  in  patient 
outcomes  research,  and  allow  their  confident  application  in  clinical  practice. 

4.0  RESEARCH  DESIGN  AND  METHODS 

This  was  a  multi-center,  prospective,  descriptive  study  designed  to  quantify  the 
reliability,  diagnostic  accuracy,  and  predictive  validity  of  commonly  used  clinical 
examination  and  patient  self-report  measures  used  to  diagnose  patients  with  suspected 
CR  and  CTS. 

4.1  Inclusion  Criteria 

To  be  eligible  for  participation  in  this  study,  individuals  must  have  been  aged  18-60 
years  and  referred  for  EMG/NCS  testing  to  rule-out  CR  and/or  CTS.  Only  patients 
judged  by  the  EMG  lab  evaluating  physician  to  have  signs  and  symptoms  compatible 
with  cervical  radiculopathy  or  CTS  were  eligible  to  participate.  In  addition,  the  patient's 
current  episode  of  symptoms  was  required  to  exceed  4  weeks  but  not  12  and  24  months 
duration  for  CR  and  CTS,  respectively.  Patients  with  the  following  conditions  were 
disqualified  from  study  participation: 

1 .  Systemic  disease  known  to  cause  a  generalized  peripheral  neuropathy. 

2.  Primary  complaint  of  bilateral  radiating  arm  pain 

3 .  History  of  conditions  involving  the  affected  upper  extremity  which  might  adversely 
affect  the  individual’s  level  of  function 

4.  Off  work  for  >6  months  due  to  the  condition. 

5.  Previous  history  of  surgical  procedures  for  pathologies  giving  rise  to  neck  pain  or  for 
CTS 

6.  Patients  who  have  had  previous  EMG/NCS  testing  of  their  symptomatic  limb  for  CR 
and/or  CTS. 

All  consecutive  patients  referred  to  the  EMG  laboratories  of  both  Montefiore  and 
Presbyterian  Hospital  for  EMG/NCS  testing  to  rule-out  CR  and/or  CTS  received 
information  about  the  study  and  complete  a  study  screening  form  (appendix  A).  If,  after 
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examining  the  patient  and  reviewing  the  screening  form,  the  EMG  lab  provider 
determined  if  the  patient  was  eligible  for  study  participation,  the  patient  was  asked  by  the 
provider  to  participate  in  the  study.  Prior  to  obtaining  informed  consent,  the  study 
investigators  or  their  representative  at  distant  participating  sites  explained  the  study  in 
detail  to  the  subject.  If  the  patient  agreed  and  gave  informed  consent  in  compliance  with 
the  standards  of  the  Biomedical  Internal  Review  Board  of  the  University  of  Pittsburgh  or 
the  Internal  Review  Board  of  the  other  respective  participating  facilities,  he  or  she  was 
admitted  into  the  study.  Volunteers  were  also  be  recruited  from  the  EMG  laboratories  of 
the  following  participating  Military  Treatment  Facilities:  The  National  Naval  Medical 
Center  (NNMC),  Bethesda,  MD;  Wilford  Hall  USAF  Medical  Center  (WHMC),  San 
Antonio,  TX;  and  the  Air  Force  Academy  Hospital  (AFAH),  Colorado  Springs,  CO. 

4.2  Methods 

42.1  Procedure 

A  video  tape  of  all  clinical  examination  procedures  and  the  disto-proximal  NCS 
technique  as  well  as  a  clinical  examination  handbook  and  an  EMG  handbook  detailing 
the  performance  of  each  clinical  examination  measure,  equipment  settings,  and 
EMG/NCS  procedures  was  distributed  to  each  participating  center  prior  to  data 
collection.  All  physical  therapist  raters  at  each  participating  facility  viewed  the  tape  and 
read  the  clinical  examination  handbook  in  order  to  familiarize  themselves  with  the 
clinical  examination  measures.  In  addition,  all  raters  participated  in  at  least  two  practice 
sessions  during  which  all  clinical  examination  measures,  except  the  asking  of  questions, 
were  performed.  Physical  therapist  raters  practiced  applying  the  specified  amount  of 
compression  or  distraction  force  required  for  the  Spurling  test,  distraction  test,  and  CCT 
using  a  bathroom  scale,  mechanical  traction  device,  and  pinch  gauge,  respectively.  All 
EMG  providers  viewed  the  tape  and  read  the  EMG/NCS  handbook  in  order  to  familiarize 
themselves  with  the  disto-proximal  NCS  procedure,  EMG/NCS  equipment  parameters, 
and  procedure  protocol. 

Once  a  patient  was  determined  to  be  eligible  and  agreeded  to  participate  in  the  study,  the 
patient  underwent  a  standardized  EMG/NCS  examination  of  the  affected  upper  quarter 
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and  completed  the  following  self-report  instruments  which  are  listed  in  appendix  B:  NDI, 
FSS,  SSS,  FABQ,  VAS,  and  Hand  Diagram.  The  electrophysiologic  examination 
consisted  of  established  EMG/NCS  procedures.67'1 1264pp  541-555  All  EMG/NCS  testing  was 
performed  by  a  physician,  physical  therapist  or  evoked  potential  technician  with 
electrophysiologic  testing  credentials. 

Within  one  week  following  the  standardized  EMG/NCS  examination,  the  patient 
underwent  two  standardized  clinical  examinations  administered  by  two  physical  therapist 
raters.  The  second  examination  was  required  in  order  to  determine  the  reliability  of  the 
clinical  examination  measures  used  in  this  study.  Therapist  raters  1  &  2  were  blinded  to 
the  patient’s  diagnosis  or  suspected  condition.  Rater  1  obtained  responses  to  the  1 1 
questions  related  to  the  patient’s  symptoms  and  performed  the  clinical  examination 
measures  with  all  participating  patients.  If  any  reproduction  or  increase  in  the  patient's 
symptoms  occurred,  the  therapist  allowed  the  symptoms  to  return  to  baseline  before 
administering  the  next  test  procedure.  Each  clinical  examination  procedure  was  graded 
or  interpreted  as  previously  described.  Following  a  five-minute  rest  period,  a  second 
rater  (rater  2)  re-administerd  the  clinical  examination  measures  to  the  patient  in  an 
identical  manner.  The  1 1  provocation  tests  were  administered  in  alternating  order  with 
each  new  patient  to  control  for  order  effects.  Rater  2  did  not  administer  the  1 1  questions 
of  history  to  the  patient  prior  to  the  examination  but  obtained  patient  responses  2-3  days 
following  the  examination.  The  1 1  questions  of  history  were  administered  to  the  patient 
by  Rater  2  at  the  next  follow-up  visit  or  by  telephone.  The  delay  in  obtaining  responses 
to  questions  of  history  by  Rater  2  was  required  to  prevent  item  recall  by  the  patient, 
which  could  confound  the  interpretation  of  reliability.  The  clinical  examination  results 
obtained  by  the  first  PT  rater  were  used  for  all  computations  of  diagnostic  test  accuracy. 

All  patients  were  mailed  a  15-point  GRCS  six-weeks  from  the  date  of  their  initial  clinical 
examination  and  were  asked  to  rate  their  improvement.  Patients  were  also  mailed  a 
treatment  form  and  asked  to  document  whether  they  had  surgical  intervention  and  list  all 
non-operative  treatment  interventions  they  had  received  since  their  initial  examination. 

All  clinical  evaluation  forms  are  contained  in  appendix  C. 
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4.2.2  Patient  Demographic  Data  and  Past  Medical  History 

The  following  demographic  data  was  collected:  age,  gender,  specialty  from  which 
referred,  workers  compensation  and  litigation  status,  and  employment  status.  Past 
medical  history  data  was  collected  and  included  any  previous  or  existing  medical 
conditions,  risk  factors  for  generalized  peripheral  neuropathy,  information  related  to  the 
onset  and  duration  of  the  current  episode  of  symptoms,  and  whether  or  not  the  patient  has 
had  previous  evaluation  and  treatment  for  the  current  condition.  The  EMG/NCS  provider 
documented  his  or  her  suspected  diagnosis  for  the  patient  as  well  as  the  diagnosis 
suspected  by  the  referring  provider.  In  addition,  the  EMG/NCS  provider  reviewed  the 
patient’s  medical  record  and  documented  the  findings  of  any  available  imaging  studies, 
prescribed  medication,  and  conservative  treatments  related  to  the  patient’s  condition. 

4.2.3  Standardized  Electrophvsiologic  Examination 

Surprisingly,  little  has  been  published  to  document  the  reliability  of  either  standard  NCS 
measurements  (latency,  velocity,  amplitude)  for  the  median  and  ulnar  nerve244  or  the 
needle  EMG  examination.  Two  studies  that  used  analysis  of  variance  and  paired  t-tests 
found  no  differences  in  latency  means  in  test-retest  studies244'246  but  this  approach  is 
inadequate  for  establishing  reliability.247  In  one  recent  unpublished  study,  Moore  et  al. 
found  excellent  intrarater  reliability  within  a  single  measurement  session  for  both  distal 
sensory  latencies  (DSL)  (ICC  2, 1  —  0.98)  and  distal  motor  latencies  (DML)  (ICC  2,1  — 
0.98). 248 

A  reliability  coefficient  has  not  been  reported  for  the  intra  or  inter-rater  reliability  of  the 
needle  EMG  examination.  However,  needle  EMG  is  the  most  sensitive 
electrophysiologic  procedure  for  detecting  axonal  loss  occuring  in  cervical  and  lumbar 
radiculopathies73'88  and  multiple  studies  have  documented  its  strong  positive  association 
with  myelography,  computed  tomography,  and  surgical  findings  (percent  agreement^ 
75%,  (95CI  61%  95%)70'71,  89%,  (95CI  80%-98%)249,  and  78%-90%73'73'77’77'78 
respectively).  Because  of  its  strong  association  with  multiple  other  diagnostic  studies  and 
surgical  observation,  the  reliability  of  the  EMG  examination  may  be  considered 
acceptable. 
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Following  a  history  and  neuromusculoskeletal  screening  examination,  the  EMG/NCS 
tester  (Tester)  thoroughly  explained  all  EMG/NCS  testing  procedures  to  the  patient  and 
answered  any  questions.  The  patient  was  asked  to  lie  supine  on  an  examination  table 
with  the  symptomatic  limb  toward  the  Tester.  The  temperature  of  the  limb  to  be  tested 
was  assessed  using  a  standard  surface  thermistor  placed  in  the  palm  of  the  hand  to  be 
tested  at  the  level  of  the  metacarpal  head.  Hand  temperature  was  >32°  C  prior  to  NCS 
testing.  The  area  over  which  the  electrodes  were  placed  will  was  cleansed  with  an 
alcohol  swab  in  order  to  decrease  skin  impedence.  If  the  patient’s  hand  temperature  was 
<  32°  C,  the  hand  was  placed  in  water  warmed  to  34  -  40°  C  and  reassessed  until  hand 
temperature  reached  the  acceptable  limit. 64pp'29'64 

All  EMG/NCS  units  had  a  current  equipment  safety  rating  prior  to  use.  The  instrument 
settings  listed  below  were  used  as  default  parameters  for  the  respective  test  procedures. 
Equipment  settings  were  changed  in  order  to  obtain  clear  and  interpretable  test  responses 
when  technical  difficulties  were  encountered.  Any  changes  made  to  default  parameters 
during  testing  were  documented. 
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Electromyograph  Instrument  Parameters 


A.  Orthodromic  evoked  SNAP/CNAP: 

C.  H-Reflex 

1)  Gain: 

1)  Gain:  500  -  l,000uV/division 

SNAP  20uV/division, 

2)  Sweep  speed:  5ms/division 

CNAP  20-50uV/division 

3)  Filter:  20-  10,000  Hz 

2)  Sweep  speed:  lms/division 

3)  Filter:  20  -  5000  Hz 

4)  Stimulus  duration:  .5  ms 

4)  Stimulus  duration:  .  1  ms 

D  EMG 

1 )  Gain: 

B.  Evoked  CMAP 

Instertional  & 

1)  Gain.  2mV/division 

spontaneous-  50  -lOOuV/division 

2)  Sweep  speed:  2ms/division 

Recruitment-  l,000uV/division 

3)  Filter:  20-  10,000  Hz 

2)  Sweep  speed: 

4)  Stimulus  duration:  .  1  ms 

Insertional  & 

spontaneous- 1 0ms/ division 

Recruitment-  lOms/division  and 

lOOms/division 

3)  Filter:  20  -  10,000  Hz 

Commercially  available  tape,  conductive  gel,  surface  bar,  and  surface  disc  electrodes 
were  used  for  nerve  conduction  studies.  All  electrode  surfaces  were  wiped  with  alcohol 
between  patients.  Commercially  available  disposable  40mm  or  50mm  monopolar  needle 
electrodes  were  used  for  all  EMG  testing.  Used  electrodes  were  disposed  of  in 
receptacles  designated  and  approved  for  such  use  (i.e.  sharps  bucket). 

Nerve  conduction  studies  were  performed  first,  followed  by  needle  electromyography. 
All  distances  used  for  electrode  placement  and  to  calculate  NCV  were  measured  along 
the  anatomic  course  of  the  nerve  with  a  tape  measure  and  recorded  in  millimeters. 
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Nerve  Conduction  Procedures:  The  stimulator  was  set  to  zero  prior  to  each  nerve 
conduction  test.  For  each  nerve  study,  the  patient  was  notified  prior  to  nerve  stimulation. 
Several  stimuli  of  gradually  increasing  intensity  were  delivered  until  a  maximal  response 
was  obtained. 64pp29'64  Evoked  response  parameters  were  measured  and  recorded  for  each 
response  in  the  following  manner: 

1.  Amplitude  (microvolts)  -  peak-to-peak  for  SNAP/CNAP  responses  and  baseline-to- 
peak  for  motor  responses 

2.  Latency  (milliseconds)-  peak  for  SNAP/CNAP  responses  and  departure  from  baseline 
for  motor  responses. 

3.  Nerve  conduction  velocity  (M/s)-  NCV  is  the  quotient  obtained  by  dividing  the  nerve 
segment  distance  in  millimeters  by  the  relevant  nerve  segment  latency  in  milliseconds. 
For  median  and  ulnar  motor  nerves,  the  relevant  nerve  segment  latency  is  first 
obtained  by  subtracting  the  distal  motor  latency  from  the  proximal  motor  latency. 

The  following  NCS  protocol  was  performed  in  order  and  in  a  standard,  previously 
reported  fashion.  U3'250’64pp29'64 

1.  Median  and  ulnar  nerve  orthodromic  palmar  CNAP  @  8.0cm  (latency  &  amplitude) 

2.  Median  SNAP  distal-proximal  ratio  (latencies  &  NCV’s): 

Stimulation  site:  Ring  electrodes  placed  on  the  third  digit  (D3),  cathode  proximal 
Recording  site  1 :  midpalm-  After  obtaining  an  evoked  potential  the  ring  to  D3 
latency  is  recorded.  Next,  the  NCV  for  the  distal  segment  (D3  to  palm)  is 
calculated  by  dividing  the  measured  distance  by  the  latency. 

Recording  site  2:  proximal  wrist  crease-  After  obtaining  an  evoked  potential,  the 
proximal  segment  latency  (midpalm  to  proximal  wrist  crease)  is  obtained  by 
subtracting  the  distal  segment  latency  (obtained  in  the  previous  step)  from  the  ring 
to  prox  wrist  crease  latency.  NCV  for  the  midpalm  to  proximal  wrist  crease 
segment  latency  is  then  calculated.  This  is  done  by  subtracting  the  distal  latency 
(midpalm  to  D3)  from  the  proximal  latency  (midpalm  to  proximal  wrist  crease). 
The  NCV  for  the  proximal  segment  is  then  calculated  by  dividing  the  measured 
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midpalm  to  proximal  wrist  crease  segment  distance  by  the  calculated  latency  for 
that  segment. 

Calculate  the  distal-proximal  ratio:  Divide  the  NCV  of  the  D3  to  mid-palm 
segment  by  the  NCV  of  the  mid-palm  to  proximal  wrist  crease  segment  to  obtain 
a  proportion. 

3.  *Ulnar  orthodromic  SNAP  @  14cm  (latency  &  amplitude)-  done  only  if  palmar  CNAP 
is  prolonged  or  technically  unobtainable 

4.  Median  and  Ulnar  distal  CMAP  @  8cm  (latency  &  amplitude) 

5.  Median  and  Ulnar  NCV  (latency,  amplitude,  and  NCV) 

Forearm  segment-  median  and  ulnar  nerve 
Elbow  segment-  ulnar  nerve 

6  *Median  and  Ulnar  F-wave  (latency)-  done  only  in  the  absence  of  motor  latency 
abnormalities  or  NCV  abnormalities  for  each  respective  nerve. 

7.  FI-Reflex-  record  flexor  carpii  radialis  affected  side  (latency) 

8.  *If  the  median  orthodromic  CNAP  or  median  distal-proximal  NCV  ratio  is  abnormal, 
then  step  1)  and  2)  will  be  repeated  on  the  opposite  hand.  If  the  median  study  of  the 
asymptomatic  side  is  abnormal,  then  step  4)  will  be  repeated. 

9.  FI-Reflex-  record  flexor  carpi  radialis  opposite  side  (latency) 

*  Conditional  procedures 

Needle  Electromyography  Procedures:  The  skin  of  the  limb  to  be  sampled  was  cleansed 
with  an  alcohol  wipe  prior  to  needle  electrode  insertion.  Each  of  the  following  muscles 
was  examined  for  insertional,  spontaneous,  and  recruitment  activity  in  the  following 
manner:  mid  and  lower  cervical  paravertebral  muscles,  deltoid,  triceps  brachii,  extensor 
carpi  radialis  longus/brevis,  flexor  carpi  radialis,  abductor  pollicus  brevis,  and  first  dorsal 
interrosseus. 

1 .  Insertional  activity-  Observed  and  recorded  as  normal,  increased,  decreased,  for  each 
muscle  sampled. 

2.  Spontaneous  activity-  For  each  muscle  site  sampled,  the  tester  utilized  the  standard 
quadrant/level  method  for  a  total  of  12  observations  at  each  sampling  site.64pp  29-64  Care 
was  taken  so  that  no  electrode  movement  occured  when  making  a  determination  of  the 
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presence  or  absence  of  spontaneous  activity.  Spontaneous  activity  in  the  form  of 
fibrillations  and  positive  sharp  waves  (PSW)  was  graded  1+,  2+,  3+,  or  4+  The  presence 
and  type  of  other  spontaneous  wave-form  activity  was  documented  as  appropriate. 

3.  Motor  unit  analysis-  Motor  unit  action-potential  (MUAP)  activity,  consisting  of 
recruitment  and  morphology,  was  assessed  at  least  once  for  each  limb  muscle  site 
sampled  at  a  gain  of  both  lOOuv  and  l,000uv;  MUAP  activity  of  paracervical  muscles 
was  not  be  assessed.  The  recruitment  frequency/ number  (ratio)  method  was  used  to 
assess  MUAP  recruitment.  The  assessment  of  MUAP  morphology  was  made  when  rise 
time  was  maximal  (<500us)  and  included  both  number  of  phases  and  amplitude.  Motor 
unit  morphology  was  graded  as  normal  or  increased  polyphasic,  and  motor  unit 
amplitude  was  graded  as  normal,  increased,  or  decreased. 

Additional  Procedures:  Other  EMG/NCS  procedures  or  additional  muscle  sampling  were 
performed  as  indicated  from  the  clinical  examination  and  were  based  on  the  Tester’s 
opinion.  The  EMG  provider  documented  all  additional  EMG/NCS  procedures  performed 
and/or  additional  muscles  sampled. 
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Grading  &  Interpretation:  For  NCS  procedures,  the  following  previously  established 
normal  values  listed  in  Table  2  were  used  in  this  study  : 


Table  5:  Type  of  NCS  Studies  and  Associated  Normal  Values 


STUDY  PARAMETERS 

Latency  (ms)  Amplitude  (uV)  NCV  (M/s) 


STUDY 

1 .  Median  and  Ulnar  midpalmar  CNAP  @  8.0cm. 

2.  Median  SNAP  distal-proximal  ratio: 

a.  3,d  digit  to  midpalm  (NCV) 

b.  midpalm  to  wrist  (NCV)  (10cm  separation 
of  midpalm  and  wrist  cathode  recording 
sites) 

c.  Calculate  distal-proximal  NCV  ratio  by 
dividing  a.  NCV  by  b.  NCV 

3.  Ulnar  SNAP  @  14cm. 

4.  Median  and  Ulnar  distal  CMAP 

5.  Median  and  Ulnar  F-wave 

6.  Median  and  Ulnar  NCV 

median  and  ulnar  nerve  in  forearm 
ulnar  nerve  across  elbow 

7.  H-Reflex- flexor  carpii  radialis 


<2.2  or 

Med  >40 

<3  med/uln  diff 

Uln  >11 

- 

Ratio  <1.0 

<3.7 

>12 

>4.3  Med 

£5000 

__ 

>3.6  Uln 

>5000 

>32.0  or 

^.5ms  med/uln  diff 

- 

- 

-- 

>5000; 

<20%  drop  from 
prox/dist.  stim  site 

>50 

<19.0  or 

^  1.0ms  R/L  diff  or 
^calculated  latency 

“ 

— 

(.29  +  ,1905(arm  length  cm)±.84) 

Each  NCS  was  graded  as  abnormal  if  it  exceeded  normal  values  for  that  study. 
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For  needle  EMG  procedures,  the  following  previously  established  criteria  were  used  to 
grade  insertional  activity,  spontaneous  activity  (fibrillations  and  PSWs),  and  MUAPs: 

1 .  Insertional  Activity: 

Normal-  electrical  activity  persists  no  longer  than  50ms  following  cessation  of 
needle  electrode  movement. 

Increased-  electrical  activity  persists  longer  than  50ms  following  cessation  of 
needle  electrode  movement 

Decreased-  few  if  any  electrical  potential  detected  during  or  following  needle 
electrode  movement 

2.  Spontaneous  Activity:  Graded  in  accordance  with  Figure  1 1  below. 


Figure  11:  Spontaneous  Activity  Characteristics  and  Grading 


Grading 


Characteristics 


0  No  fibrillations  or  PSW 

1+  Persistent/unsustained  single  trains  in  at  least  two  sites  of  muscle  sampled 

2+  Moderate  numbers  in  three  or  more  sites  of  muscle  sampled 

3+  Many  in  all  muscle  sites  sampled 

4+  Baseline  obliterated  with  fibrillation  potentials  in  all  muscle  sites  sampled 


Other  types  of  spontaneous  activity  consistent  with  denervation,  when  observed, 
were  documented. 

3.  Motor  Unit  Action  Potential  (MUAP)  Recruitment  Analysis: 

Normal-  frequency  of  preceding  motor  unit  5-10Hz  prior  to  recruitment  of  a 
successive  motor  unit. 

Decreased-  Ratio  of  fastest  firing  MUAP  and  number  MUAPs  observed  >10. 
Increased-  Ratio  of  fastest  firing  MUAP  and  number  MUAPs  observed  <  3. 


4.  MUAP  Morphology  Analysis: 
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Waveform  morphology  was  observed  with  the  needle  electrode  in  close  proximity  to 
MUAP  as  evidenced  by  crisp  sound  and  MUAP  rise  time  <500us. 

Normal-  MUAPs  consisting  of  2-5  phases  of  #10  -  15ms  in  duration  and  <6000uV 
amplitude 

Abnormal-  Multiple  MUAPs  present  with  6  or  more  phases  >  10  -  15ms  duration. 
Multiple  MUAPs  with  amplitude  >  6000uV,  a  combination  of  the  previous  two 
finding,  or  most  MUAP’s  with  amplitude  <1000uV  amplitude. 

Classification:  The  results  of  the  EMG/NCS  examination  consistent  with  CR  and 
consistent  with  CTS  were  used  to  classify  patients  according  to  severity  of  findings  for 
each  respective  condition. 

1 .  Normal-  No  abnormalities  noted 

2.  Unilateral  median  nerve  (CTS)  abnormalities:127 

Mild-  any  abnormal  median  sensory  latency  or  disto-proximal  ratio.  All  other 
sensory  and  motor  NCS  parameters  normal. 

Moderate  -  abnormal  sensory  or  disto-proximal  ratio  and  distal  motor  latency. 
CNAP  amplitude  may  be  diminished  but  >50%  of  normal.  Motor  NCV  normal. 
Pronounced-  abnormal  sensory  and  distal  motor  latency.  CNAP  amplitude  <50% 
of  normal.  CMAP  amplitude  may  be  diminished  but  >50%  of  normal.  Mild 
slowing  of  forearm  NCV  may  be  present  (>45  M/s)  and  spontaneous  activity  may 
be  noted  on  EMG  exam. 

Severe-  Absent  CNAP,  abnormal  distal  motor  latency,  CMAP  amplitude  <50%  of 
normal  or  absent.  Mild  (>45  M/s)  slowing  of  forearm  NCV  may  be  present  and 
EMG  abnormalities  are  present. 

3.  Bilateral  median  (CTS)  abnormalities  (each  hand  classified  as  above  according  to 
severity). 

4. *  Classification  number  2  or  3  above  with  concomitant  ulnar  nerve  abnormalities 

5.  Radiculopathy  abnormalities: 

Mild-  H-reflex  abnormality  alone  and/or  1+  spontaneous  activity  in  one  or  more 
muscles.  Other  EMG/NCS  parameters  normal 
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Moderate-  2+  -  3+  spontaneous  activity  in  two  or  more  muscles.  Increased 
recruitment,  polyphasicity,  and  increased  amplitude/duration  of  some  MUAP’s 
may  be  observed 

Severe-  3+  -  4+  spontaneous  activity  in  two  or  more  muscles.  Either  increased 
recruitment  ratio,  polyphasicity,  or  increased  amplitude/duration  of  many 
MUAP’s  is  observed. 

6. **  Radiculopathy  with  concomitant  CTS  (double  crush;  both  conditions  classified 
according  to  their  respective  severity  scales) 

7. **  Other:  EMG/NCS  studies  consistent  with  other  peripheral  neuropathy  or  myopathy. 

*  Subjects  classified  in  groups  4  &  7  will  be  eliminated  from  the  study  based  on 
inclusion/exclusion  criteria. 

**Data  from  subjects  classified  in  group  6  will  be  analyzed  separately  in  a  descriptive 
fashion. 

4.2.4  Diagnostic  Tests 

4,2.4. 1  Clinical  Examination  Procedures 

Questions  of  History:  All  patients  were  asked  1 1  questions  of  history  related  to  their 
signs  and  symptoms  in  the  manner  previously  described.  These  1 1  questions  along  with 
their  respective  possible  responses  are  listed  in  appendix  C. 

Conventional  Neurological  Examination  of  the  Upper  Extremity: 

Strength  testing  was  conducted  through  manual  muscle  testing  of  the  deltoid  (C5),  biceps 
brachii  and  extensor  carpi  radialis  longus/brevis  (C6),  triceps  brachii  and  flexor  carpi 
radialis  (C7),  abductor  pollicus  brevis  (C8),  and  dorsal  interossei  (Tl).  All  manual 
muscle  testing  was  conducted  using  the  methods  of  Kendall  and  McCreary  and 
performed  with  the  subject  sitting.251  The  deltoid  was  tested  by  resisting  shoulder 
abduction.  The  biceps  brachii  was  tested  by  resisting  elbow  flexion  with  the  forearm 
supinated.  The  extensor  carpi  radialis  longus/brevis  was  tested  by  resisting  wrist 
extension  from  a  neutral  forearm  position  and  90°  elbow  flexion.  The  flexor  carpi 
radialis  was  tested  by  resisting  wrist  flexion  from  a  neutral  forearm  position  and  90° 
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elbow  flexion.  The  triceps  brachii  was  tested  by  resisting  elbow  extension  from  a 
position  of  90°  of  elbow  flexion.  The  abductor  pollicus  was  tested  with  the  forearm 
supinated  and  wrist  in  a  neutral  position.  The  first  dorsal  interossei  was  tested  by 
resisting  abduction  of  the  first  finger.  The  result  of  each  muscle  test  was  graded  as 
“absent”,  “markedly  reduced”,  “reduced”,  or  “normal” 

Muscle  stretch  reflexes  of  the  bicep  (C5-6),  brachioradialis  (C5-6),  and  Triceps  (C7)  were 
tested  bilaterally  using  a  standard  reflex  hammer.  The  result  of  each  muscle  stretch 
reflex  was  graded  as  “absent”,  “reduced”,  “normal”,  or  “hyper/increased”  as  compared 
to  the  unaffected  extremity  . 

Sensation  testing  was  performed  by  testing  sensitivity  to  light  touch  for  the  different 
cervical  dermatomes  (C5-C8)  and  discrete  areas  of  median  nerve  cutaneous  distribution 
(palmar  surface  of  digits  1-3).  Testing  was  performed  by  having  the  examiner  touch  the 
skin  in  a  key  area  for  each  respective  sensory  level  with  a  disposable  paper  clip  that  was 
discarded  following  testing.  A  new  paper  clip  was  used  for  each  patient.  Each 
dermatome  level  of  the  right  and  left  upper  limb  was  tested  sequentially.  The  C5 
dermatome  was  tested  over  the  deltoid  muscle,  C6  along  the  radial  aspect  of  the  second 
metacarpal  and  index  finger,  C7  on  the  mid-posterior  forearm  and  dorsal  aspect  of  the 
middle  finger,  and  C8  along  the  medial  border  of  the  5th  finger.  The  discrete  areas  of 
median  nerve  cutaneous  distribution  were  tested  by  comparing  the  palmar  cutaneous 
distribution  of  digits  1  -3  with  the  cutaneous  distribution  of  the  thenar  eminence  and 
midpalm  area.  The  result  of  each  sensory  test  was  graded  as  “absent”,  “reduced”, 
“normal”,  or  “hyperesthestic”  in  comparison  to  the  unaffected  extremity. 

Range-of-Motion  and  Wrist  Diameter  Measurements:  The  cervical  ROM  measures 
obtained  in  this  study  include:  flexion,  extension,  bilateral  side-bending,  and  bilateral 
rotation  and  were  obtained  in  the  following  manner:  While  seated  in  a  chair  and  prior  to 
measurement  by  a  physical  therapist  rater,  the  patient  was  asked  to  assume  a  neutral  neck 
position  satisfactory  to  both  the  patient  and  examiner.  Once  an  acceptable  neutral 
position  has  been  assumed,  the  therapist  applied  a  piece  of  colored  tape  to  the  wall  at  eye 


74 


level.  This  was  referred  to  by  the  therapist  as  the  "neutral  position".  The  patient  was 
then  asked  to  perform  warm-up  movements  consisting  of  two  repetitions  in  each  motion 
direction.  Immediately  following  the  warm-up  procedure,  the  rater  recorded  a  single 
ROM  measurement  for  flexion,  extension,  and  bilateral  side-bending  using  a  bubble 
inclinometer  as  described  by  Hole.170  Bilateral  rotation  was  measured  using  a  standard 
long-arm  goniometer 

The  wrist-ratio  index  is  a  proportion  derived  from  the  ratio  of  anterior-posterior 
(numerator)  and  medial-lateral  (denominator)  wrist  width  measured  in  centimeters.  A 
single  pair  of  sliding  calipers  was  used  to  measure  both  anterior-posterior  and  medial- 
lateral  wrist  width.  From  these  measurements  a  wrist  ratio  index  was  computed. 

Provocation  Tests:  Provocative  tests  were  performed  sequentially  according  to 
operational  definition.  The  starting  order  for  testing  was  varied  in  a  systematic  fashion  to 
prevent  the  confounding  influence  of  order  effects.  Starting  with  the  first  subject,  the 
Rater  began  the  clinical  examination  by  administering  the  first  test  or  measure  on  the 
testing  list.  For  the  second  subject,  the  rater  began  with  the  second  clinical  examination 
measure  on  the  testing  list.  This  procedure  was  continued  for  each  successive  subject. 

An  increase  or  decrease  in  symptoms  refered  to  the  symptoms  associated  with  the 
patient’s  condition,  not  discomfort  or  pain  associated  with  the  test  procedure  that  is 
unrelated  to  the  patient’s  condition.  The  following  phrase  was  used  when  the  patient  was 
questioned  regarding  the  influence  of  a  test  procedure  on  their  symptoms:  "Did  that 
increase  or  decrease  your  symptoms  in  any  way?" 

The  following  provocative  tests  were  performed  in  this  study: 

1.  Neck  Compression  Test  (method  A) 

2.  Neck  Compression  Test  (method  B) 

3.  Distraction  Test 

4.  Shoulder  Abduction  Test 


5.  Valsalva  Maneuver 

6.  Upper  Limb  Tension  Test  (ULLT 
A) 
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7.  Upper  Limb  Tension  Test  (ULLT  B)  10.  Phalen’s  Test 

8.  Tinel’s  Sign  (method  A)  11.  Carpal  Compression  Test 

9  Tinel’s  Sign  (method  B) 

4. 2, 4, 2  Patient  Self-Report  Measures:  Prior  to  the  EMG/NCS  examination,  patients 
completed  the  following  self-report  measures:  NDI,  Brigham  and  Women’s  Hospital 
SSS/FSS,  Hand  diagram  and  10cm  VAS;  and  FABQ. 

4,2,5  Patient  Outcome  Gold  Standard 

At  six-weeks,  a  follow-up  form  was  mailed  to  all  patients.  In  addition  to  the  GRCS,  the 
form  listed  questions  and  corresponding  responses  inquiring  about  the  patient’s  surgical 
and  conservative  treatment  history  since  enrollment  in  the  study. 

All  self-report  forms,  including  the  follow-up  form,  are  listed  in  appendix  B 

4.3  Data  Analysis 

In  addition  to  the  analyses  of  diagnostic  accuracy  and  predictive  validity  described 
below,  descriptive  statistics  (mean,  frequency,  range,  and  standard  deviation)  were 
computed  for  all  variables  of  the  clinical  examination,  patient  self-report  instruments,  and 
EMG/NCS  findings,  dependent  upon  the  appropriate  scale  of  measurement. 

4,3,1  Hypothesis  #1  -  Reliability 

The  first  hypothesis  to  be  tested  is  the  inter-rater  reliability  of  all  clinical  examination 
items.  Reliability  coefficients  for  the  patients  self-report  measures  used  in  this  study 
have  been  previously  reported  and  will  not  be  reassessed. 

Reliability  has  been  defined  as  the  consistency  of  a  measurement  when  all  conditions  are 
thought  to  be  held  constant.252  Reliability  reflects  the  degree  to  which  a  score  is  free 
from  errors  of  measurement  and  may  be  described  as  the  percentage  of  score  that  is 
information  (signal)  as  opposed  to  random  error  (noise).253  A  reliable  test  or  measure  has 
at  least  three  aspects:  1 .  Repeated  measurement  should  be  expected  to  repeat  the  same 
score  on  two  different  occasions;  2.  Measures  obtained  can  be  depended  on  to  give  a 


76 


close  approximation  of  the  true  score;  3 .  Allows  one  to  generalize  what  will  occur  on 
future  measurement  occasions.254  Reliability  is  a  prerequisite  for  validity.252  Therefore, 
validity  can  only  be  meaningfully  interpreted  when  a  measure  is  reliable. 

The  use  of  an  unreliable  measure  may  result  in  several  undesirable  consequences. 
Unreliable  measures  will  attenuate  correlations  between  variables  and  thereby  diminish 
the  ability  to  detect  a  relationship  if  one  exists.  A  direct  result  of  this  attenuation  is  the 
need  for  increased  sample  sizes  to  obtain  a  significant  effect  in  clinical  trials.  Unreliable 
measures  will  also  contribute  to  biased  samples.255  Strube  has  described  a  number  of 
different  reasons  that  may  cause  a  test  or  measure  to  be  unreliable.  Sources  of 
unreliability  include:  examiners  perform  the  test  or  measure  differently;  examiners 
perform  the  test  or  measure  similarly  but  different  standards  are  used  as  anchor  points; 
and  examiners  enter  data  differently,  resulting  in  coding  errors.256  Miscommunication 
and  lack  of  understanding  may  also  contribute  to  unreliability. 

Another  reason  for  unreliability  is  the  lack  of  variability  in  the  item  of  interest. 

Reliability  indices  (rxx)  are  a  ratio  of  the  variance  of  interest  over  the  sum  of  the  variance 
of  interest  plus  error  as  depicted  below.254  It  is  clear  from  this  ratio  that  in  order  for  an 
index  of  reliability  to  be  interpretable,  there  must  first  be  variability  to  explain. 

Fxx  =  true  score  variablitv 

true  score  variability  +  error  variability 

Finally,  unreliability  may  result  from  disagreement  among  the  raters.  In  this  latter  case, 
there  may  be  no  way  to  modify  the  procedure  in  order  to  achieve  reliability  and  the 
measure  will  no  longer  be  useful.254 

Reliability  estimates  rely  on  measurement  models  and  their  assumptions.  The 
measurement  scale  of  the  data  determine  which  model  is  appropriate  for  obtaining  a 
reliability  coefficient.254  Bartko  has  described  three  approaches  for  estimation  of  the 
reliability  of  nominal  or  categorical  data.257  The  first  is  descriptive  and  merely  computes 
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the  percentage  of  agreement  between  raters.  A  second  approach  is  to  use  a  coefficient  of 
association  such  as  the  Phi  or  Spearman  Rho  statistic  for  dichotomies  or  rank-order 
correlation. 243pp  446-449  The  problems  with  the  percent  agreement  approach  are  a  lack  of  a 
standardized  range  for  interpretation  and  no  correction  for  chance  agreements.242  The 
correlation  approach  is  also  problematic:  it  only  indicates  the  degree  of  association  for 
paired  scores,  not  agreement.  The  covariance  of  paired  ratings  may  be  very  different  than 
actual  agreement  if  systematic  error  is  present.258  The  third  and  recommended  approach 
is  the  use  of  Cohen’s  Kappa  statistic:254'259  Kappa  is  interpreted  as  an  intraclass 
correlation  coefficient  and  represents  the  proportion  of  agreement  among  raters  after 
chance  agreement  has  been  removed.242  Kappa  is  expressed  symbolically  as:242 

K=  Po-Pc 
l  -  Pc 


Where  Po  equals  the  observed  proportion  of  agreement  and  Pc  is  the  proportion  of 
expected  agreement  based  on  chance;  chance  agreement  increases  as  the  variability  of 
observed  ratings  decreases.  Kappa  values  theoretically  range  from  -1  to  +1  but  extreme 
values  are  often  restricted  by  reduced  variability  of  the  data.242'254  Positive  Kappa  values 
are  interpreted  as  actual  agreement  beyond  that  expected  by  chance;  values 
approximating  zero  indicate  chance  agreement  and  values  less  than  zero  are  interpreted  as 
agreement  that  is  worse  than  that  expected  by  chance.242  Landis  and  Koch  have  proposed 
the  following  ranges  of  Kappa  coefficients  and  corresponding  strength  of  agreement 
associated  with  them:260 


Kappa  Statistic 


Strength  of  Agreement 


<0.00 

0.00-0.20 

0.21-0.40 

0.41-0.60 

0.61-0.80 

0.81-1.00 


Poor 

Slight 

Fair 

Moderate 
Substantial 
Almost  Perfect 
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Although  the  ranges  they  proposed  are  arbitrary,  they  have  found  acceptance  in  the 
measurement  literature242'254  and  allow  a  consistent  nomenclature  for  describing  the 
strength  of  agreement  associated  with  Kappa  statistics.260  Fleiss  has  simplified  this 
descriptive  scale  in  the  following  manner:  values  below  0.40  to  represent  poor 
agreement;  values  between  0.40  and  0.75  represent  fair  to  good  agreement  beyond 
chance,  and  values  greater  than  .075  represent  excellent  agreement  beyond  chance.242 
Some  extended  uses  of  Kappa  include:  allowance  for  more  than  two  raters,  different 
raters  for  each  subject,  and  allowance  for  missing  data.135 

Two  models  of  reliability  have  been  described  for  interval  and  ratio  scaled  data.247  Both 
models  reflect  the  ratio  of  the  variance  of  interest  over  the  sum  of  the  variance  of  interest 
plus  error  as  previously  described  and  produce  a  reliability  statistic  called  an  intraclass 
correlation  coefficient  (ICC),  which  is  designated  by  the  symbol  R.  The  ICC  provides  a 
meaningful  index  of  how  dependably  a  measure  maps  onto  or  is  correlated  with  the 
underlying  characteristic  being  assessed.254  An  ICC  is  directly  interpretable  as  a 
proportion  of  explained  variance  and  describes  the  ability  of  a  measure  to  differentiate 
between  subjects.255  The  first  model  of  reliability  considered  is  the  classic  or 
psychometric  theory  of  reliability  in  which  every  test  score  is  considered  to  be  composed 
of  two  parts:  true  score  and  error  score.  The  error  score  is  comprised  of  true  random 
error  and  error  from  other  sources.  The  psychometric  theory  of  reliability  treats  all 
sources  of  error  the  same  and  makes  no  distinction  between  them.  Alternatively,  the 
generalizability  theory  of  reliability  encompasses  a  second  model  of  reliability  that 
allows  the  error  score  to  be  partitioned  in  to  several  sources  of  variability  termed 
“facets”.254  A  random  effects  ANOVA  is  utilized  to  partition  the  total  variation  in  scores 
into  separate  components  corresponding  to  the  variables  in  the  design.  In  this  manner, 
error  sources  that  exert  a  systematic  influence  can  be  estimated  and  separated  from  their 
error  component.254  All  reliability  estimates  for  interval  and  ordinal  level  measures  in 
this  study  will  be  based  on  the  generalizability  theory  of  reliabilty.  Similar  to  Landis  and 
Koch  and  Fleiss,  Portney  and  Watkins  have  described  the  following  ranges  of  ICC  and 
the  strength  of  agreement  associated  with  them:  R  <.75=  Poor  to  Moderate;  R  .75  -  .90= 
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Good;  and  R  >.90=  Excellent.  For  most  clinical  measurements  R  should  exceed 

90  243p.514 

A  measure  that  is  highly  reliable  does  not  necessarily  mean  it  is  of  value  when  applied  to 
individual  patients.  In  addition  to  acceptable  reliability,  the  accuracy  or  precision  of  a 
measure  is  important.  Accuracy  or  precision  are  synonymous  terms  related  to  reliability 
and  refer  to  the  variability  of  one  person’s  score  over  measurement  occasions.254 
Precision  may  be  expressed  as  the  standard  error  of  measure  (SEM)  and  is  depicted 
symbolically  as  SEM  =  SDx  (l-R)!/2,  where  SDx  is  the  standard  deviation  of  the  measure 
of  interest  and  R  is  the  ICC,  or  reliability  coefficient,  of  the  measure  of  interest.254 
Estimates  of  both  precision  and  reliability  are  important.  Indeed,  low  reliability  may  be 
of  little  concern  if  the  index  of  variability  suggests  the  inconsistency  of  measurements 
occur  in  a  relatively  small  range.  Measurement  methods  should  provide  data  that  are 
both  sufficiently  reliable  and  precise.261 

Reliability  coefficients  have  been  reported  for  only  a  few  of  the  clinical  examination 
procedures  in  this  study.  Therefore,  estimates  of  reliability  were  obtained  for  all  clinical 
examination  procedures  assessed  by  this  study.  The  reliability  coefficients  for  all  clinical 
examination  measures,  with  the  exception  of  cervical  range-of-motion  and  wrist  ratio, 
were  reported  as  a  Kappa  statistic.  Kappa  was  also  reported  for  ordered  responses  such 
as  the  CCT,  Phalens  test,  the  ULTT’s,  and  selected  questions  of  history  in  addition  to 
Kappa  for  collapsed  categories  (i.e.  dichotomy).  The  qualitative  interpretation  for  Kappa 
described  by  Fleiss  et.  al.  was  used  in  this  study.  Measures  with  Kappa  values  above  .75 
were  considered  excellent  or  exceptionally  reliable;  those  with  Kappa  values  between 
0.40  and  0.75  were  considered  to  have  fair  to  good  reliability;  and  those  below  0.40  were 
considered  poor  and  unreliable.242  The  reliability  coefficient  for  cervical  range-of-motion 
and  the  wrist  ratio  was  reported  as  an  ICC  (2,1)  statistic  along  with  the  corresponding 
SEM  for  both  measures.  The  reliability  of  cervical  range-of-motion  and  wrist  ratio 
measurements  were  considered  excellent  and  well  suited  for  clinical  use  if  an  ICC  of  .90 
is  achieved.243p  514 
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Point  estimates  of  the  reliability  coefficient  were  used  to  decide  whether  to  accept  or 
reject  research  hypotheses  of  reliability.  However,  the  confidence  interval  method  was 
used  to  determine  whether  the  point  estimate  of  a  reliability  coefficient  was  a  definitive 
finding.  A  definitive  finding  is  considered  to  be  a  finding  or  value  which  would  within  a 
given  range  95%  of  the  time  with  repeated  sampling. 243pp  292 " 294  Nintey-five  percent  Cl’s 
were  computed  for  all  reliability  coefficients.  When  the  lower  bound  of  the  95CI  for  a 
given  variable  was  equal  to  or  above  the  upper  limit  of  the  hypothesized  level  of 
reliability  for  that  variable,  then  the  point  estimate  for  that  level  of  reliability  was 
considered  to  be  definitive.  When  the  upper  bound  of  the  95CI  for  a  given  variable  was 
equal  to  or  below  the  lower  limit  of  the  hypothesized  level  of  reliability  for  that  variable, 
then  the  point  estimate  for  that  level  of  reliability  was  also  considered  definitive. 


4.3.2  Hypothesis  #2  -  Diagnostic  Accuracy 

The  second  hypothesis  to  be  tested  relates  to  the  diagnostic  properties  of  the  clinical 
examination  items  and  patient  self-report  instruments.  The  test  items  and  condition  they 
are  intended  to  diagnose  or  evaluate  are  listed  on  the  following  page. 

Cervical  Radiculopathy  Carpal  Tunnel  Syndrome 


Questions 

1. -  “Most  bothersome  symptoms..” 

2. -  “Where  most  bothersome..” 

3. -  “Symptom  behavior..” 

6  -  “Entire  limb  numb..” 

7.-  “Symptoms  keep  from  sleep..” 
9..-  “Neck  movement  improves..” 

Neurologic  exam:  sensory  (dermatomes), 
motor,  and  reflexes 

Measures:  ROM  (flexion,  extension, 
sidebending,  and  rotation) 

Provocation  tests:  Spurling’s  A  &  B, 
Distraction,  Shoulder  abduction,  Valsalva, 
ULTT  A&B 

HSAM/Self-report  Instruments:  NDI, 
lOcmVAS,  FABQ 


Questions: 

1. -  “Most  bothersome  symptoms..” 

2. -  “Where  most  bothersome..” 

3. -  “Symptom  behavior..” 

4. -  ‘“Hand  fat/swollen..” 

5. -  ‘Tumbling/dropping..” 

6  -  “Entire  limb  numb..” 

8  -  ‘Night  symptoms  wake  .” 

10. -  “Hand  shaking  improves  .” 

1 1 . -  “Worse  with  hand  use.  ” 

Neurologic  exam:  sensory  and  (dermatomes  and 
median  distribution),  motor 

Measures:  Wrist  ratio 

Provocation  tests:  ULTT  A&B,  Tinel’s  A&B, 
Phalen’s,  CCT 


HSAM/Self-report  Instruments:  SSS  &  FSS, 
Hand  diagram,  10cm  VAS,  FABQ 
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The  diagnostic  properties  of  a  single  cluster  of  test  items  (test  item  cluster  (TIC)) 
identified  by  logistic  regression  as  being  the  best  predictor  of  each  respective  condition 
was  also  evaluated.  Additional  variables  of  age  and  duration  of  symptoms  were  included 
in  this  analysis 

Sensitivity  and  Specificity:  Two-by-two  contingency  tables  were  used  to  calculate  Sn 
and  Sp  for  each  test  item  relative  to  their  respective  condition,  either  CR  or  CTS,  for  the 
total  sample.  Ninety-five  percent  confidence  intervals  were  also  calculated  for  all  Sn  and 
Sp  values.  In  this  study,  the  presence  of  each  condition  as  determined  by  EMG/NCS 
findings  constituted  the  gold  standard  to  which  the  positive  and  negative  findings  of  each 
test  item  were  compared.  For  example,  all  patients  classified  as  CTS  were  considered  to 
have  the  condition  present  and  the  remainder  of  patients  the  condition  was  considered 
absent.  Because  diagnostic  tests  may  have  different  sensitivities  or  specificities  in 
different  parts  of  the  clinical  spectrum  of  the  disease  they  purport  to  identify  or  exclude,50 
Sn  and  Sp  were  also  be  calculated  for  subgroups  of  patients  in  both  conditions  based  on 
severity  of  EMG/NCS  findings.  Patients  classified  as  mild  or  moderate  CR  or  CTS 
formed  one  subgroup  within  each  respective  condition.  Patients  classified  with 
pronounced  or  severe  CTS  and  patients  classified  as  severe  CR  formed  the  other 
subgroup.  Subgroup  calculations  were  only  be  performed  if  the  prevalence  of  condition 
was  10%  or  greater  for  a  given  subgroup. 24pp  90-91  Sensitivity  and  specificity  calculations 
have  been  previously  described  in  section  2.2.3. 

To  avoid  confounding,  all  patients  determined  to  have  both  CR  and  CTS  were  excluded 
from  the  diagnostic  accuracy  analysis  and  were  reported  as  a  percentage  or  frequency 
statistic  of  the  total  sample.  The  concomitant  presence  of  both  conditions  has  been 
described  in  the  literature  and  is  often  referred  to  as  the  “double-crush” 
phenomenon.18  20'263'264  Based  on  previous  reports,  the  percentage  of  patients  in  this 
study  expected  to  have  CR  and  CTS  concomitantly  is  approximately  3-5%. 20-263 

Receiver  Operating  Characteristic  fROC)  Curve:  In  order  to  identify  the  most 
appropriate  cut-off  value  for  continuous  or  multi-level  response  variables,  the  Sn  and  Sp 
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values  for  each  level  of  response  were  plotted  as  a  ROC  curve.  The  ROC  curve  plots 
sensitivity  (true  positive  ratio)  against  1  -  specificity  (false  positive  ratio)  for  the  criterion 
defining  a  positive  test.265  An  ROC  curve  is  simply  a  graph  of  the  pairs  of  true  positive 
rates  and  false  positive  rates  that  correspond  to  each  possible  cutoff  value  for  the 
diagnostic  test  result.24pp  117  The  area  may  range  from  .5  (no  diagnostic  ability)  to  1.0 
(perfect  diagnostic  ability)  as  the  ROC  curve  moves  towards  the  top  left-hand  comer  of 
the  graph;266  the  area  under  a  ROC  curve  represents  the  diagnostic  ability  of  the  test. 
Receiver  operating  characteristic  curves  were  constructed  for  the  following  variables: 
Cervical  range-of-motion,  wrist  ratio,  and  all  patient  self-report  measures.  The  value 
closest  to  the  upper  left-hand  comer  of  the  graph  minimizes  the  occurrence  of  false 
positive  and  false  negatives  when  the  prevalence  rate  is  around  50%.24pp'118 

Likelihood  ratios  (LR+  &  LR-)  were  calculated  for  each  test  item  in  the  manner 
previously  described  in  section  2.2.3  along  with  their  associated  95CI. 

TIC  Cluster:  Clinical  diagnosis  rarely  resides  in  a  single  finding,  but  more  often  in  the 
pattern  of  findings.27  Therefore,  using  a  combination  of  tests  as  a  single  TIC  may 
increase  the  diagnostic  value  of  the  tests. 16,267p  41  Because  each  component  of  the  clinical 
examination  can  be  considered  a  separate  test,  one  must  choose  how  to  incorporate  the 
numerous  results.23  One  method  of  combining  various  items  of  the  clinical  examination 
is  the  use  of  LR’s  to  sequentially  modify  posterior  probability  of  the  presence  or  absence 
of  a  target  disorder  and  was  illustrated  in  the  example  in  section  2.2.3.  However,  this 
serial  multiplication  of  LR’s  assumes  that  the  tests  are  conditionally  independent.241^ 333 
Conditional  independence  means  the  result  of  one  test  is  not  affected  by  the  outcome  of 
any  of  the  other  tests  performed.23  If  the  assumption  of  test  independence  is  violated, 
diagnostic  accuracy  can  be  degraded  and  result  in  inaccurate  assessments.268’24pp  136-1 39 

The  method  described  by  Holleman  and  Simel  was  used  to  identify  the  most  accurate 
TIC’s:  one  TIC  for  the  diagnosis  of  CR  and  another  for  the  diagnosis  CTS.  To  reduce  the 
number  of  variables,  LR’s  for  test  items  with  a  95%CI  that  included  .60  to  1 .4  were  be 
excluded  from  further  consideration;  LR  values  of  one  or  close  to  one  are  indeterminate 
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and  therefore  are  not  considered  useful.23  Remaining  variables  were  then  entered  using  a 
backward  stepwise  procedure  into  a  binary  logistic  regression  model  (condition  present 
or  absent).  Variables  selected  by  the  regression  model  as  most  predictive  of  the  condition 
of  interest  were  combined  or  clustered  into  a  TIC  and  treated  as  a  single  test  item.  The 
sensitivity,  specificity,  and  LR’s  for  the  TIC  was  computed  as  previously  described. 
Although  the  variables  identified  by  this  method  may  still  be  interdependent  to  some 
degree,  Holleman  and  Simel  reported  no  difference  in  prediction  ability  between  this 
method  and  a  more  complex  procedure  that  forced  conformity  with  the  independence 
assumption.23 

In  this  study,  point  estimates  and  95%  Cl’s  for  the  Sn,  Sp,  likelihood  ratios  (LR+,  LR-), 
were  computed  for  each  clinical  examination  item,  patient  self  report  measure,  and  TIC. 

The  diagnostic  accuracy  values  of  individual  clinical  examination  variables  were 
considered  acceptable  for  their  respective  condition  when  any  of  the  following  occured: 

1 .  Either  Sn  or  Sp  is  equal  to  or  greater  than  .70;  2.  A  LR+  equal  or  greater  than  2.0;  and 
3.  A  LR-  equal  to  or  less  than  .50.  When  test  Sn  and  Sp  values  both  equal  or  exceed  .70, 
LR+  and  LR-  values  will  exceed  2.3  and  be  below  .43,  respectively.  Based  on  the 
estimated  prevalence  or  pretest  probability  of  CR  and  CTS  in  this  sample,  LR+  values 
>2.0  and  LR-  values  <.5  will  result  in  posttest  probability  changes  of  at  least  15%. 

The  guidelines  listed  in  the  preceding  paragraph  were  used  to  accept  or  reject  the 
previously  specified  hypothesis  of  diagnostic  accuracy  for  an  individual  clinical 
examination  variable  as  well  as  determine  whether  a  diagnostic  accuracy  value  is 
considered  defninitve. 

4.3.3  Hypothesis  #3  -  Test  Predictive  Validity 

The  third  hypothesis  to  be  tested  relates  to  the  predictive  validity  of  the  clinical 
examination  items  and  patient  self-report  instruments.  In  addition,  the  FABQ  score  and 
the  following  EMG/NCV  variables  were  also  included  as  predictor  variables:  sensory  and 
motor  nerve  conduction  latency;  sensory  and  motor  amplitudes;  and  presence  of 
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spontaneous  activity  in  the  abductor  pollicus  brevis.  The  diagnostic  properties  of  a  single 
TIC  identified  by  logistic  regression  as  being  the  best  predictor  of  patient  outcome  was 
also  evaluated. 

Although  current  gold  standards  tend  to  be  defined  in  terms  of  pathologic  anatomy, 
clinical  course  and  prognosis  of  patients  are  increasingly  used  as  gold  standards. 24pp  81 
Predictive  validity  for  each  test  item  and  a  single  TIC  were  reported  as  Sn,  Sp,  and  LR’s. 
In  this  study,  the  following  analyses  were  performed  for  both  the  CR  group  and  the  CTS 
group  to  establish  the  predictive  validity  of  the  pertinent  test  items. 

For  the  first  analysis,  type  of  intervention,  defined  as  surgical  or  non-surgical,  served  as 
the  gold  standard.  Patients  who  received  surgery  were  considered  positive  and  those 
treated  non-surgically  were  considered  negative.  If  the  prevalence  of  surgery  for  either 
condition  is  less  than  10%,  an  analysis  of  predictive  validity  using  type  of  intervention 
was  not  be  performed  for  that  condition.  The  ability  of  a  test  to  produce  a  meaningful 
change  in  posttest  probability  for  a  condition  is  severely  diminished  when  prevalence  of 
the  condition  is  at  either  extreme. 24pp'90'91 

For  the  second  analysis,  patient  outcome  defined  by  patient  improvement  using  a  GRCS 
was  the  gold  standard.  The  criteria  recommended  by  Jaeschke  et.  al.  was  used  to 
determine  subject  improvement:  subjects  scoring  between  -5  and  +3  (“somewhat  worse” 
and  “somewhat  better”,  respectively)  were  considered  unimproved  (stable,  no  meaningful 
change  in  condition)  or  to  have  deteriorated.  Subjects  scoring  greater  than  +3 
(“moderately  better”  to  “a  very  great  deal  better”)  were  considered  improved  or  to  have 
undergone  clinically  meaningful  change.  Patients  who  were  unimproved  or  worsened 
were  considered  negative  and  those  who  are  improved  were  considered  positive.  Two 
separate  analyses  of  patient  improvement  were  performed,  one  for  non-surgically  treated 
subjects  and  another  or  surgically  treated  subjects.  The  patient  improvement  analysis  of 
surgically  treated  subjects  was  not  be  performed  if  surgery  prevalence  for  either  condition 
was  less  than  25  subjects  or  less  than  10%. 
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For  the  third  analysis,  patient  outcome  defined  by  worsened  patient  condition  using  a 
GRCS  served  as  the  gold  standard.  The  criteria  recommended  by  Jaeschke  et.  al.  was 
used  to  determine  subject  improvement:  subjects  scoring  between  -5  and  +3  (“somewhat 
worse”  and  “somewhat  better”,  respectively)  were  considered  unimproved  (stable,  no 
meaningful  change  in  condition).  Subjects  scoring  less  than  -5  (“moderately  worse”  to  “a 
very  great  deal  worse”)  were  considered  worsened  or  to  have  undergone  clinically 
meaningful  change.  Patients  who  are  worsened  were  considered  positive  and  those  who 
are  unchanged  or  improved  were  considered  negative.  Two  separate  analyses  of 
worsened  patient  condition  were  performed,  one  for  non- surgically  treated  subjects  and 
another  or  surgically  treated  subjects.  The  patient  improvement  analysis  of  surgically 
treated  subjects  was  be  performed  if  surgery  prevalence  for  either  condition  was  less  than 
25  subjects  or  less  than  10%. 

Two-by-two  contingency  tables  for  Sn  and  Sp,  ROC  curves.  Likelihood  ratios  (LR+; 
LR-),  and  identification  of  a  single  test  item  cluster  were  computed  in  the  manner 
previously  described  for  hypothesis  #2.  The  same  criteria  used  for  hypothesis  #2  was 
used  to  accept  or  reject  the  previously  specified  hypothesis  of  predivitive  validity  for  an 
individual  clinical  examination  variable  or  the  TIC  and  to  determine  which  variables 
were  to  be  considered  definitive. 

All  statistical  test  procedures  were  computed  using  Microsoft  Excel  and  the  SPSS 
statistical  software  packages. 

5.0  SAMPLE  SIZE  ESTIMATION 

Because  this  study  is  descriptive  in  nature,  a  sample  size  estimate  derived  from  power 
calculations  based  on  group  differences  is  not  possible.  Instead,  sample  size  was  based 
on  the  ability  of  this  study  to  detect  the  following:  1.  An  ICC  of  .90  that  is  significantly 
greater  than  .75  at  an  alpha  level  of  .05  and  beta  level  of  .20  for  a  one-tailed  test  (i.e. 
power  is  greater  than  ,80);243p  514  2.  A  Kappa  coefficient  of  .60  that  is  significantly 
greater  than  .40  at  an  alpha  level  of  .05  and  beta  level  of  .20  for  a  one-tailed  test  with  a 
base  chance-agreement  rate  of  .50  (i.e.  power  is  greater  than  .80), 242  and  3.  A  test 
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sensitivity  or  specificity  of  .80  whose  95%CI  has  a  minimum  lower  bound  that  exceeds 
.68.  Forty  subjects  (20  of  each  condition)  from  each  of  the  following  facilities  will  be 
required  for  this  study  (160  total  subjects):  The  National  Naval  Medical  Center 
(NNMC),  Bethesda,  MD;  Wilford  Hall  USAF  Medical  Center  (WHMC),  San  Antonio, 
TX;  and  the  Air  Force  Academy  Hospital  (AFAH),  Colorado  Springs,  CO  The  procedure 
described  by  Kraemer  and  Thiemann  and  implemented  by  the  EX-SAMPLE  statistical 
computer  package  indicates  that  this  sample  size  is  more  than  adequate  to  establish  the 
specified  reliability  coefficients  for  each  facility.269 

Based  on  the  estimated  prevalence  rates  for  each  condition  in  this  study,  a  sample  size  of 
160  is  the  minimum  for  which  the  lower  bound  of  the  95%  confidence  interval  for  a  true 
sensitivity  or  specificity  of  .80  would  exceed  .68. 55 

6.0  RESULTS 

6.1  Study  Sample  and  Diagnostic  classification 

A  total  of  8 1  patients  from  the  following  three  participating  medical  centers  met 
eligibility  criteria  and  were  enrolled  in  the  study:  Wilford  Hall  Medical  Center  (n=  68), 
EMG  laboratories  of  both  Montefiore  and  Presbyterian  University  Hospital  (n=  11),  and 
Brooke  Army  Medical  Center  (BAMC)  (n=2).  Two  of  the  original  participating  centers, 
the  National  Naval  Medical  Center  and  the  United  States  Air  Force  Academy  Hospital, 
eliminated  themselves  during  the  course  of  the  study  due  to  subject  enrollment 
difficulties.  One  additional  facility,  BAMC,  participated  in  subject  enrollment  after  the 
study  commenced.  Due  to  limited  subject  enrollment  at  all  facilities,  the  original  study 
entry  criteria  for  duration  of  symptoms  was  eliminated  and  duration  of  symptoms  was 
recorded  for  all  subjects.  The  Institutional  Review  Board  of  all  participating  facilities 
approved  all  changes  to  the  study  protocol.  All  consecutive  patients  referred  to  the  EMG 
lab  with  suspected  CR,  CTS,  or  with  other  suspected  diagnosis  but  had  symptoms 
compatible  with  CR  or  CTS  were  informed  about  the  study  by  EMG  lab  personnel. 
Interested  subjects  were  asked  to  fill  out  a  screening  form  to  determine  eligibility 
(appendix  A).  Interested  and  eligible  subjects  were  given  further  information  about  the 
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study,  then  read  and  signed  an  informed  consent  document  approved  by  the  respective 
facility  Institutional  Review  Board. 

The  frequency  of  conditions  suspected  by  the  referring  provider  is  compared  with  the 
conditions  suspected  by  the  EMG/NCS  provider  in  Table  6;  the  EMG/NCS  provider 
suspected  condition  was  not  available  for  5  subjects.  Although  the  same  number  of 
subjects  were  suspected  by  the  referring  providers  to  have  CR,  there  was  not  always 
concordance  between  the  two  providers.  The  referring  provider  suspected  CTS  in  three 
subjects  diagnosed  with  CR  while  the  EMG/NCS  provider  suspected  CR,  normal,  and 
both  conditions  for  these  same  individuals.  None  of  the  subjects  who  participated  in  this 
study  were  receiving  workman's  compensation  or  had  pending  litigation  for  their 
condition. 


Table  6.  Condition  suspected  by  providers 


Referring  Provider  Electromyography 

Provider 

Condition 

Frequency 

Radiculopathy 

29 

29 

CTS 

42 

31 

Both 

5 

7 

Other 

5 

9 

Total  Available 

81 

76 

Seven  different  EMG/NCS  providers  performed  the  nerve  conduction  studies,  needle 
electromyography  procedures,  and  subsequent  diagnostic  classification  of  subjects.  At 
one  center,  three  different  evoked  potential  technicians  performed  nerve  conduction 
procedures  only.  The  qualifications  of  the  EMG/NCS  providers  and  evoked  potential 
technicians  are  listed  in  Table  7.  All  EMG/NCS  diagnostic  classifications  made  by  non¬ 
physician  EMG/NCS  providers  were  reviewed  and  approved  by  the  supervising 
EMG/NCS  lab  physician  who  is  board  certified  by  the  American  Academy  of 
Electrodiagnostic  Medicine  (AAEM).  There  were  no  disagreements  between  the  non¬ 
physician  EMG/NCS  providers  and  supervising  EMG/NCS  lab  physicians  regarding 
diagnostic  classification  of  subjects. 
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Table  7.  Qualifications  of  personnel  performing  electrophysioiogic  testing 


Facility  and  Provider 

Number,  and  Role 

Years  EMG/NCS 

Experience 

Board 

Certification 

Number  of  studies 

performed 

University  of  Pittsburgh 

1.  EMG 

5 

Yes:  AAEM 

4 

2.  EMG 

7 

Yes:  AAEM 

2 

3.  EMG 

2 

Yes:  AAEM 

3 

4.  EMG 

6 

Yes:  AAEM 

1 

5.  EMG 

4 

Yes:  AAEM 

1 

6.  NCS. 

15 

Yes:  AAEM 

6 

7.  NCS 

5 

Yes:  AAEM 

4 

8.  NCS 

2 

Yes:  AAEM 

'  1 

Wilford  Hall  Medical 

Center 

1.  EMG/NCS 

10 

Yes:  ABPTS  ECS 

49 

2.  EMG/NCS 

17 

Yes:  ABPTS  ECS 

19 

Brooke  Army  Medical 

Cetner 

1.  EMG/NCS 

17 

Yes:  ABPTS  ECS 

2 

AAEM:  American  Academy  of  Electrodiagnostic  Medicine 

ABPTS  ECS:  American  Board  of  Physical  Therapy  Specialties,  Electrophysioiogic  Certified 
Specialist 


Forty-one  females  (mean  age=  44.9yrs,  sd=  12.5  range=  24  -70)  and  40  males  (mean 
age=  45.0yrs,  sd=l  1 .4,  range=  21  -  68)  participated  in  this  study.  Once  enrolled  in  the 
study,  subjects  completed  all  self-report  instruments  and  received  a  standardized 
EMG/NCS  examination.  Following  the  standardized  EMG/NCS  examination,  subjects 
were  assigned  to  the  following  diagnostic  categories  based  on  the  results  of  the 
EMG/NCS  examination  and  the  assessment/impression  of  the  EMG/NCS  provider:  1 . 
Normal  (n=  31),  2.  Unilateral  CTS  (n=  16),  3.  Bilateral  CTS  (n=  15),  4.  CTS  with  ulnar 
neuropathy  (n=  1),  5.  Cervical  radiculopathy  (n=  13),  6.  Cervical  radiculopathy  with 
CTS  (n=  3),  7.  Other  (n=  2). 


The  subjects  age,  duration  of  symptoms,  and  several  nerve  conduction  study  parameters 
of  the  median  nerve  are  compared  in  Tables  8  and  9  by  diagnostic  category  and  gender. 
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Table  8.  Descriptive  statistics  of  subjects  age  and  duration  of  symptoms. 


EMG/NCS 
based  Dx 

Gender 

N 

Variable 

Age=  years 
Symptoms=  days 

Mean  Minimum  Maximum 
Median 

Sd  Dev. 

Normal 

Female 

17 

Age 

39.29 

24.00 

61.00 

12.14 

Symptoms 

123.5 

31.00 

5415.00 

Male 

14 

Age 

38.78 

21.00 

68.00 

10.94 

Symptoms 

184.5 

21.00 

7220.00 

Unilateral  CTS 

Female 

10 

Age 

51.90 

31.00 

70.00 

12.97 

Symptoms 

352 

21.00 

1460.00 

Male 

6 

Age 

38.00 

28.00 

49.00 

7.89 

Symptoms 

365 

56.00 

1277.00 

Bilateral  CTS 

Female 

9 

Age 

44.77 

28.00 

61.00 

11.53 

Symptoms 

250 

31.00 

5475.00 

Male 

6 

Age 

47.16 

36.00 

60.00 

10.00 

Symptoms 

61 

21.00 

365.00 

CTS  w /  Ulnar 

Female 

1 

Age 

43.00 

43.00 

43.00 

neuropathy 

Symptoms 

30.00 

30.00 

30.00 

Radiculopathy 

Female 

2 

Age 

56.50 

55.00 

58.00 

2.12 

Symptoms 

42.00 

42.00 

42.00 

Male 

11 

Age 

50.90 

39.00 

61.00 

7.68 

Symptoms 

69.5 

42.00 

1095.00 

Radiculopathy 

Female 

1 

Age 

52.00 

52.00 

52.00 

w/CTS 

j 

Symptoms 

Male 

2 

Age 

62.00 

60.00 

64.00 

2.82 

Symptoms 

31.50 

21.00 

42.00 

Other 

Female 

1 

Age 

46.00 

46.00 

46.00 

Symptoms 

87.00 

87.00 

87.00 

Male 

1 

Age 

62.00 

62.00 

62.00 

Symptoms 

551.00 

551.00 

551.00 
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Table  9.  Descriptive  statistics  of  subjects  median  nerve  conduction  study  test  results. 


EMG/NCS  Median  Nerve 
based  Dx  Parameters* 


Normal 


Involved  palmar  31 
latency 

Involved  motor  31 
latency 

Involved  palmar  31 
amplitude 
Involved  motor  31 
amplitude 

Distal-proximal  ratio  27 


Mean 

Minimum 

Maximum 

Deviation 

1.89 

1.60 

2.20 

.15 

3.62 

2.90 

4.30 

.41 

81.09 

40.00 

140.00 

25.80 

10229.35 

5000.00 

18650.00 

3139.99 

.78 

.60 

.90 

.10 

Unilateral  CTS 


Involved  palmar  16 
latency 

Involved  motor  16 
latency 

Involved  palmar  16 
amplitude 
Involved  motor  16 
amplitude 

Distal-proximal  ratio  12 


3.28 

4.95 

62.62 

7106.25 

.99 


1.80 

10.00 

2.65 

3.30 

14.90 

2.78 

.00 

183.00 

46.34 

100.00 

10770.00 

2937.15 

.60 

1.50 

.23 

Bilateral  CTS 


Involved  palmar  15 
latency 

Involved  motor  15 
latency 

Involved  palmar  15 
amplitude 
Involved  motor  15 
amplitude 

Distal-proximal  ratio  11 


3.68 

2.30 

10.00 

2.59 

4.79 

4.00 

6.40 

.77 

46.53 

.00 

114.00 

33.63 

8446.00 

4900.00 

13300.00 

2522.23 

1.21 

1.00 

1.60 

.21 

CTS  w /  Ulnar  Involved  palmar  1 

neuropathy  latency 

Involved  motor  1 
latency 

Involved  palmar  1 
amplitude 
Involved  motor  1 
amplitude 

Distal-proximal  ratio  1 


2.30 
3.20 

42.00 

10000.00 

1.30 


1.30  1.30 


Radiculopathy 


Involved  palmar 
latency 
Involved  motor 
latency 
Involved  palmar 
amplitude 
Involved  motor 
amplitude 
Distal-proximal  ratio 


13  1.99 

13  3.73 

13  72.92 

13  10495.38 
13  .76 


1.80 

2.20 

.16 

3.20 

4.20 

.35 

40.00 

160.00 

41.46 

5780.00 

18240.00 

3654.60 

.60 

.90 

.10 
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Table  9  (cont’d) 


Radiculopathy 

Involved  palmar 

3 

3.03 

2.80 

3.20 

.2082 

w/CTS 

latency 
Involved  motor 

3 

4.46 

4.40 

4.50 

5.77 

latency 
Involved  palmar 

3 

43.66 

38.00 

52.00 

7.3711 

amplitude 
Involved  motor 

3  10500.00 

8490.00 

12000.00 

1809.7237 

Other 

amplitude 
Distal-proximal  ratio 

Involved  palmar 

1 

2 

1.30 

6.25 

2.50 

10.00 

.01 

latency 
Involved  motor 

2 

6.55 

4.80 

8.30 

2.47 

latency 
Involved  palmar 

2 

18.00 

.00 

36.00 

25.45 

amplitude 
Involved  motor 

2 

7230.00 

4720.00 

9740.00 

3549.67 

amplitude 
Distal-proximal  ratio 

2 

.70 

.00 

1.40 

.98 

•Latency  in  milliseconds  and  amplitudes  in  microvolts. 


Thirteen  subjects  (16%)  were  classified  with  CR,  the  left  extremity  was  involved  in  nine 
subjects  and  the  right  in  three.  These  CR  subjects  were  suspected  by  the  EMG  provider 
to  have  the  following  conditions  prior  to  the  EMG/NCS  examination:  CR=  10,  CTS-  1, 
both  conditions^  1,  other=  1.  The  following  conditions  were  suspected  for  these  same  13 
subjects  by  the  provider  who  referred  the  patient  to  the  EMG  lab:  CR=  9,  CTS=  4.  The 
13  subjects  diagnosed  with  CR  and  31  subjects  diagnosed  with  CTS  were  further 
subclassified  based  on  the  severity  of  EMG/NCS  findings.  For  the  13  CR  subjects,  nine 
were  subclassified  as  mild  and  four  as  moderate.  The  frequency  of  needle  EMG  findings 
for  muscles  tested  in  the  standardized  exam  are  listed  in  Table  10.  Needle  EMG  testing 
of  muscles  other  than  those  specified  in  the  standardized  EMG/NCS  exam  was  permitted 
when  thought  indicated  by  the  EMG/NCS  provider.  The  additional  muscles  sampled, 
along  with  frequency  and  findings,  and  are  listed  in  Table  11.  In  only  one  instance  did 
additional  muscle  testing  yield  abnormal  findings  (brachioradialis)  when  the  results  from 
the  previously  tested  standardized  muscles  was  normal.  Two  subjects  were  unable  to 
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tolerate  EMG  sampling  of  the  middle  cervical  paraspinal  muscles  and  one  subject  was 
unable  to  tolerate  EMG  sampling  of  the  lower  cervical  paraspinal  muscles.  Because  the 
flexor  carpi  radial, s  H-reflex  was  technrcally  unobtainable  in  42%  of  the  subjects,  „  was 
eliminated  as  part  of  the  diagnostic  criteria  for  CR.  The  diagnosttc  report  for  all  13 
subjects  indicated  involvement  of  the  C6  or  C7  root,  possible  involvement  of  the  C8  root 
in  1  subject,  and  C5  root  in  two  subjects. 

Thirty-one  subjects  were  classified  as  CTS,  the  left  extremity  was  involved  in  1 1  subjects 
and  the  right  in  20.  These  CTS  subjects  were  suspected  by  the  EMG  provtder  to  have  the 
following  conditions  prior  to  the  EMG/NCS  examination:  CTS*  19,  CR*  4,  both*  a,  and 
other=3;  ratings  for  two  subjects  were  missing.  The  following  conditions  were  suspected 
for  these  same  3 1  subjects  by  the  referring  provider:  CTS*  20,  CR*  6,  both*  3,  and 
other=2  Of  the  3 1  CTS  subjects,  14  were  classified  as  mild,  7  as  moderate,  9  as 
pronounced,  and  1  as  severe.  The  subclassificatton  of  CTS  subjects  is  presented  in 
Tables  12  and  13  along  with  descriptive  statistics  for  several  electrophystologic 
parameters,  age,  and  duration  of  symptoms.  Needle  EMG  findings  in  the  adbuc.or 
pollicus  brevis  are  listed  by  subclassification  in  Table  14.  There  were  five  CTS  subjects 
whose  only  abnormal  NCS  finding  was  a  >  2ms  median/ulnar  palmar  latency  difference, 
and  four  subjects  whose  only  abnormal  NCS  finding  was  a  distal-proximal  ratio  >1.0. 

The  diagnosis  in  the  remainder  of  these  subjects  was  based  on  a  prolonged  median 
palmar  latency  and/or  other  concomitant  abnormal  median  nerve  conduction  study 
parameters.  All  subjects  diagnosed  with  bilateral  CTS  had  symptoms  that  were 
predominate  in  one  hand.  The  hand  with  predominate  symptoms  was  considered  to  be 
the  involved  limb  for  the  purposes  of  this  study  and  was  used  for  subclassificat.on  and 

subsequent  clinical  testing. 

Following  the  standardized  EMG/NCS  examination  and  approximately  a  15  to  jO  minute 
rest  period,  a  standardized  clinical  examination  was  performed  by  a  physical  therapist 
(Rater  1)  and  repeated  by  a  second  physical  therapist  (Rater  2)  following  a  five 
minute  rest  period.  Both  raters  were  blinded  to  the  subjects  suspected  diagnosis, 
EMG/NCS  test  results,  and  diagnostic  classification.  Nine  different  physical  therapists 
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performed  the  standardized  clinical  examination  procedures.  Distinct  rater  pairs  were 
formed  although  substitute  of  raters  did  occur.  The  qualifications  of  the  phystca 
therapist  raters  are  listed  in  Table  15. 


Table  10  Frequency  of  needle  electromyography  findings  in  standard.zed  examination 
muscles. 

1+  2+  3+  4+ 

Fibs/PSW  Fibs/PSW  Fibs/PSW  Fibs/PSW 

1  " 

2 
1 


Muscle  Tested 

Silent  at 
Rest 

Deltoid 

11 

Triceps  brachii 

11 

Extensor  carpi  radialis 

12 

longus/brevis 

Flexor  carpi  radialus 

|  10 

Abductor  pollicus  brevis 

13 

First  dorsal  interosseus 

13 

Middle  cervical 

6 

paravertebrals  muscles 

Lower  cervical 

6 

paravertebrals 

Fibs/PSW-  Fibrillation  potentials  and  positive  sharp  waves 
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Table  11  Frequency  of  needle  electromyography  findings  in 
to  the  standard  examination  muscles. 


muscles  sampled  in  addition 


Muscle  Tested 

Pronator  Teres 
Biceps  Brachii 

Supraspinatus 
Infraspinatus 

Flexor  carpi  ulnaris 
Extensor  indicis  proprius 
Flexor  digitorum 
profundus  (slips  3  &  4) 

Brachioradialis 


Silent  at 
Rest 

T 
6 
4 
4 
3 
1 
1 


3+ 


4+ 


1+  2+ 

Fibs/PSW  Fibs/PSW  Fibs/PSW 


Fibs/PSW 


2 

1 

1 


Fibs/PSW-  Fibrillation  potentials  and  positive  sharp  waves 
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Table  12  Descriptive  statistics  of  median  nerve  conduction  study  test  results  by  c  p 
tunnel  syndrome  subclassification  category 


CTS 

subclassification 
Mildl 


N 


Median  Nerve 
Parameters* 

Involved  palmar  14 
latency 

Involved  motor  14 
latency 

Involved  palmar  14 
amplitude 
Involved  motor  14 


Mean  Minimum  Maximum  Qeviation^ 
2.24' 


5800.00  12600.00 


Moderate 


Pronounced 


amplitude 

Distal-proximal  ratio  14 

.95 

.60 

1.20 

.15 

Involved  palmar 

7 

2.75 

2.10 

3.40 

.51 

latency 
Involved  motor 

7 

4.75 

4.10 

5.40 

.41 

latency 
Involved  palmar 

7 

63.00 

39.00 

114.00 

28.70 

amplitude 
Involved  motor 

7 

8044.28 

4900.00 

13300.00 

2991.82 

amplitude 
Distal-proximal  ratio 

5 

1.22 

1.20 

1.30 

.004 

Involved  palmar 

9 

5.23 

2.40 

10.00 

3.59 

latency 
Involved  motor 

9 

5.36 

4.40 

6.70 

.89 

latency 
Involved  palmar 

9 

23.22 

.00 

52.00 

22.21 

amplitude 
Involved  motor 

9 

7345.55 

1900.00 

10290.00 

2642.84 

amplitude 
Distal-proximal  ratio 

4 

1.47 

1.20 

1.60 

.18 

Severe|  Involved  palmar  1 
latency 

Involved  motor  1 
latency 

Involved  palmar  1 
amplitude 
Involved  motor  1 
amplitude 

Distal-proximal  ratio  0 


absent 


- 

14.90 

14.90 

.00 

.00 

.00 

100.00 

100.00 

100.00 

•Latency  in  milliseconds  and  amplitudes  in  microvolts. 
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Table  13  Descriptive  statistics  of  subjects  age  and  symptom  duration  by  subclassifcation 


Carpal  Tunnel 
subclassificatio 

n 

Variable 

Age=  years  N 

Svmptoms=  days 

Mean 

- Minimum  Maximum  Std.  Deviation 

Median 

Mild 

Age  14 

47.5714 

31.00 

70.00 

Symptom  Duration 

154.5 

154 

5475 

Moderate 

Age  7 

35.7143 

28.00 

46.00  6.2906 

Symptom  Duration 

184 

184 

2555 

Pronounced 

Age  9 

53.6667 

42.00 

68.00  9.0830 

Symptom  Duration 

155 

21 

1460 

Severe 

Age  1 

34.0000 

34.00 

34.00 

Symptom  Duration 

Table  14  Frequency  of  spontaneous  activity  in  the  abductor  poilicus  brevis  of  subjects 
with  CTS 


CTS 

subclassification 

Spontanteous 

Activity 

Frequency 

Mild 

Silent  at  rest 

14 

Moderate 

Silent  at  rest 

6 

1+  Fibs/PSW 

1 

Pronounced 

Silent  at  rest 

5 

1+  Fibs/PSW 

3 

2+  Fibs/PSW 

1 

Severe 

4+  Fibs/PSW 

1 
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Table  15  Qualifications  of  physical  therapist  raters 


Center  number  and 
Therapist  Number 


Years  of 
Clinical 
Experience 


University  of 
Pittsburgh  (1) 

1 

2 

3 

Brooke  Army 
Medical  Center  (2) 

1 

2 

Wilford  Hall 
Medical  Center  (3) 
1 


7 

15 

9 


15 


24 


15 


Board 

Certification 


Rater  Pairs 
Number  First  and  Number 
Examinations  of  Paired 
performed  Examinations 
Performed 


No 

Yes:  ABPTS  OCS 
Yes:  ABPTS  OCS, 
SCS 


Yes:  OCS 
Rater  3,  facility  1 


Yes:  ABPTS  OCS 


Yes:  ABPTS  OCS 


4 

7 


56 


1  &3=5 
1  &  5=  1 


1  &2=  3 
1  &  3=  50 

1  &4=  1 

2  &  3=  1 

3  &  4=  4 


3 

4 


13 

25 


Yes:  ABPTS  OCS 
Yes:  ABPTS  OCS, 
SCS, 


6 

5 


ABPTS  OCS:  American  Board  o,  Physics,  Therapy  Species,  Orthopaedic  Certified 
TbPTS  SCS:  American  Board  of  Physica,  Therapy  Specie, ties,  Sports  Certified  Specs  ,s 


R I!  w“:T Medical  Center  performed  50  exammat ions  together. 
”  ol  rater  pan  ,  &  3  were  used  to  compute  -  reiiahiiity  statistics  due  to  the 
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limited  number  of  examinations  performed  by  other  specified  rater  pairs.  The  J 1  subjects 
not  included  in  the  reliability  analysis  did  no.  differ  from  the  other  50  subjects  with 
regard  to  age,  NDI,  SSS,  FSS,  or  pain  ratings  (all  p  values  >.05). 

The  reliability  statistics  for  clinical  examination  variables  are  shown  in  Table  1 6  and 
include  SEM  values  of  continuous  measures.  The  stem-and-leaf  plots  m  Figur 
show  the  distribution  of  scores  for  continous  measures.  Due  to  low  obseived  base-rates 
in  uninvolved  subjects,  only  tests  results  of  the  involved  limb  of  were  used  to  compute 
reliability  statistics  for  the  neurologic  clinical  examination  variables  Due  to 
asymmetric  contingency  tables,  reliability  statistics  were  only  computed  using  collapse 
categories  (3  levels)  and  dichotomized  results  for  ULTT  A,  ULTT  B,  and  Phalen^s 
Due  to  the  low  obseived  base-rates  of  increased  sensation  (rater  1  n=4,  rater  3  m=  5)  an 
hyper-reflexia  (rater  1  n=2,  rater  3  n-  0),  results  for  dermatomes,  median  nerve  fields, 
and  MSR’s  were  dichotomized  into  normal  (normal  or  hyper)  or  abnormal  (reduce  ) 
findings  There  were  no  manual  muscle  test  (MMT)  scores  of  zero.  Rater  1  identified  a 
subjects  with  P-  to  F  ratings  and  the  remainder  of  subjects  received  ratings  ,n  the  other 
two  categories.  Therefore,  reliability  statistics  were  computed  for  MMT  scores  that  were 
dichotomized  as  normal  and  abnormal.  No  abnormal  findings  for  the  tncep  and 
brachioradialis  muscle  stretch  reflexes  were  recorded  for  rater  pair  1  &  3  so  rehab,  ,.y 
could  no.  be  computed  for  these  variables.  Reliability  statistics  were  also  computed  for 
transformed  ratings  that  identified  an  abnormality  of  any  dermatome,  myotome  or  median 
sensory  field  of  the  involved  limb.  The  reliability  of  these  variables  was  fair  to  good  w„h 
Kappa  values  of  .51,  .64,  and  .48,  respectively.  Although  reliability  for  MMT  of  the 
abductor  pollicus  brevis  was  poor  when  assessed  in  subjects  with  a  variety  of  conditions, 
i,  demonstrated  fair  to  good  reliability  when  assessed  in  subjects  diagnosed  w„h  CT 
(Kappa=  .65).  However,  the  observed  base  rate  was  still  low  (valid  n-22,  observe 

rate  of  10%). 
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Table  16 


Reliability  of  clinical  examination  variables  with  95%  confidence 


intervals  (95CI) 


Variable 

question  1-“Most 
bothersome  symptoms..” 


Kappa  95CI 

.74  (.55  -  .93) 


ICC  95CI 


question  2-“Where  most 
bothersome..” 


.82  (.68  -  .96) 


question  3-“Symptom 
behavior..” 
question  4-“Hand 
fat/swollen..” 


.57  (.35  -  .79) 
.85  (.68-  1.0) 


question  5- 

“Fumbling/dropping..” 

.95  (.85 -1.0) 

— 

question  6-“Entire  limb 
numb..” 

.53  (.26 -.81) 

- 

question  7-“Symptoms 
keep  from  sleep..” 

.70  (.48  -  .92) 

— 

question  8-“Night 
symptoms  wake..” 

.83  (.60-  1.0) 

— 

question  9-“Neck 
movement  improves..” 

.67  (.44  -  .90) 

— 

question  10-“Hand  shaking 

.90  (.75-1.0) 

— 

improves..” 

question  11-“Worse  with 

.72  (.49  -  .95) 

hand  use..” 

C5  Dermatome 

.67  (.33  -  1.0) 

C6  Dermatome 

.28  (.00  -  .58) 

C7  Dermatome 

.40  (.06  -  .74) 

C8  Dermatome 

.16  (.00 -.50) 

T1  Dermatome 

.46  (.04  -  .88) 

MMT  deltoid 

.62  (.28  -  .96) 

MMT  biceps  brachii 

.69  (.36  -  1.0) 

MMT  extensor  carpi 

.63  (.26-1.0) 

radialis  longus/brevis 

MMT  triceps  brachii 

.29  (.00  -  .79) 

MMT  flexor  carpi  radialis 

.23  (.00  -  .69) 

MMT  abductor  pollicus 

.39  (.00  -  .80) 
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Table  16  (cont’d.). 


Variable 
MMT  first  dorsal 
interrosseus 
biceps  brachii  MSR 
median  sensory  field  1 
median  sensory  field  2 
median  sensory  field  3 
dermatome  (any  abnormal) 
MMT  (any  abnormal) 
Median  nerve  field  (any 


Kappa  95CI 

.37  (.00  -  .80) 

.73  (.38 -1.0) 
.48  (.23  -  .73) 
.50  (.25-75) 
.40  (.12 -.68) 
.51  (.26  -  .76) 
.64  (.35  -  .93) 

.48  (.22  -  .74) 


ICC  95C1 


abnomal) 

Spurling’s  A 

.60  (.32  -  .87) 

Spurling’s  B 

.62  (.25  -  .99) 

Shoulder  abduction 

.20  (.00  -  .59) 

Valsalva 

.69  (.36-  1.0) 

Distraction 

.88  (.64-  1.0) 

“ 

Tinel  A 

.47  (.21  -  .72) 

Tinel  B 

.35  (.10  -  .60) 

ULTT  A  (collapsed) 

.70  (.51  -  .89) 

ULTT  A  (dichotomized) 

.76  (.51  - 1.0) 

ULTT  B  (collapsed) 

.45  (.23  -  .67) 

ULTT  B  (dichotomized) 

.83  (.65-  1.0) 

CCT 

.68  (.59  -  .86) 

CCT  (dichotomized) 

.77  (.58  -  .96) 

Phalen’s  (collapsed) 

.44  (.26  -  .62) 

Phalen’s  (dichotomized) 

.79  (.59 -1.0) 

.79  (.65 

cervical  flexion 

cervical  extension 
Cervical  left  rotation 
Cervical  right  rotation 
Cervical  left  sidebending 
Cervical  right  sidebending 
Wrist  anterior-posterior 
Wrist  medial-lateral 


.75  (.59  -  .85) 
.63  (.22  -  .82) 
.63  (.40  -  .78) 
.68  (.62  -  .87) 
.77  (.62  -  .87) 
.86  (.75  -  .92) 


4.6  degrees 
4.8  degrees 

6.6  degrees 

7.3  degrees 

5.3  degrees 

5.4  degrees 
2.1  millimeters 
2.1  millimeters 
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Figure  12.  Distribution  of  cervical  flexion  measurements 


Frequency  Stem  &  Leaf 


2.00 

Extremes 

5.00 

3 

9.00 

4 

24.00 

5. 

24.00 

6 

15.00 

7 

1.00 

8 

(=<25 ) 

00558 

000055556 

000000000445555555555588 

000000000000445555555558 

000000000000055 

0 


Stem  width: 
Each  leaf: 


10.00 

1  case ( s ) 


Figure  13.  Distribution  of  cervical  extension  measurements 
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Figure  14.  Distribution  of  cervical  left  rotation  measurements 
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Figure  15.  Distribution  of  cervical  right  rotation  measurements 
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Figure  16.  Distribution  of  cerical  left  side-bending  measurements 
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Figure  17.  Distribution  of  cervical  right  side-bending  measurements 
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Figure  18.  Distribution  of  wrist  anterior-posterior  measurements 
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Figure  19.  Distribution  of  wrist  medial-lateral  measurements 


The  hypothesized  level  of  reliability  for  each  clinical  examination  variable  and  the  status 
of  the  hypothesis  is  listed  in  Figures  20  through  22.  All  judgments  of  hypotheses  for 
provocative  tests  were  based  on  tests  with  dichotomous  results.  In  summary,  the 
following  levels  of  reliability  according  to  the  criteria  of  Landis  and  Koch  or  Portney 

and  Watkins243pMJ  were  determined  for  the  clinical  examination  variables  in  this  study. 

Excellent  (K>.75  or  ICO.90):  Distraction,  ULTT  A  and  B  (dichotomized), 
CCT  (dichotomized),  Phalen’s  test  (dichotomized),  questions  2-“Where  most 
bothersome..”,  4-“Hand  fat/swollen  ”,  5--Tumbling/dropping..”,  Slight  symptoms 

wake  ”  and  10-“Hand  shaking  improves.” 

Fair  to  Good  (K=.40  -  .75)  or  Good  (ICO  .75  -  .90)1  Spurling's  A  and  B, 

Valsalva,  Tinel's  A,  ULTT  A  and  B  (collapsed),  CCT,  Phalen’s  (collapsed),  questions  1 
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“Most  bothersome  symptoms..”,  3-”Symptom  behavior..”,  6-“Entire  limb  numb..  ,  7- 
“Symptoms  keep  from  sleep..”,  9-”Neck  movement  unproves..”,  1  l-“Worse  with  hand 
use..”,  C5,  C7  and  T1  dermatomes,  MMT  of  the  deltoid,  biceps  brachii.  and  extensor 
carpi  radialis  longus/brevis,  bicep  MSR,  median  sensory  fields  1, 2,  and  3.  dermatome 
(any  abnormality),  myotome  (any  abnormality),  and  median  sensory  levels  (any 
abnormality),  cervical  flexion,  extension,  left  rotation,  and  the  anterior-posterior/medial 

lateral  measurements  of  the  wrist  ratio. 

Poor  (K<40)  or  Poor  to  Moderate  (ICCX75):  Shoulder  abduction.  Tinel  B.  C6 
and  C8  dermatomes,  MMT  of  the  triceps  brachii,  flexor  carpi  radialis.  abductor  pollicis 
brevis,  and  the  first  dorsal  interossei,  cervical  right  rotation  and  bilateral  sidebendmg. 


Questions  1-3,  6,  7 
and  9 


Neck  ROM 


1,  3,  6,  7,  and  9 

Rejected :  All  ROM 
parameters 


Wrist  Ratio 
measurements 


Tinel’s  A 
Tine’s  B 


Accepted:  questions 
2,  4,  5,  8,  andIO 
Rejected:  questions 
1,  3,  and  11 
Rejected:  both 
anterior-posterior 
and  medial-lateral 
measures 
Rejected 
Rejected 


Route  20  Hypothesis:  Clinical  examination  variables  will  demonstrate  an  Excellent  level 
oSbility  (Kappa  >  .75  or  Intraclass  Correlation  Coefficient  >.90) 
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Sensation 


MSR* 

MMT 


Spurling’s  A 
Spurling's  B 
Shoulder  Abduction 
Valsalva 
Neck  Distraction 


Accepted:  C5,  C7, 
and  T1  dermatome 
Rejected:  C6  and  C8 
dermatomes 
Accepted:  biceps 
brachii 

Accepted:  deltoid, 
biceps  brachii, 
extensor  carpi 
radialis, 

Rejected :  triceps 
brachii,  flexor  carpi 
radialis,  adbuctor 
pollicus  brevis* 

Accepted 

Accepted 

Rejected 

Accepted 

Rejected 


MMT  abductor 
pollicus  brevis* 
Phalen’s 


CCT 


Accepted:  Median 
nerve  sensory  fields 
1-3 

Accepted 

Rejected 


Rejected 


Figure  21.  Hypothesis:  eiinical  exammationvarjabies  waidemonstrateaFa  raters 


CR  Variables 

Hypothesis  status 

CTS  Variables 

Hypothesis  status 

ULTT  A 

Rejected 

ULTT  A 

Rejected 

ULTT  B 

Rejected 

ULTT  B 

Rejected 

Fioure  22.  Hypothesis:  Clinical  examination  variables  will  demonstrate  a  Poor  (K  <  .40)  or 
Poor  to  Moderate  (ICC  <.75)level  of  reliability 
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Based  on  the  95CI’s  for  each  variable,  the  level  of  reliability  was  considered  to  be 
definitive  for  the  following  variables: 

Excellent:  question  5 

Fair  to  Good  (Kappa)  or  Good  (ICC):  Distraction,  ULTTA  (3  levels),  UL 
&  B  (dichotomous).  CCT  (3  levels),  CCT  and  Phalen's  (dichotomous),  questions 
1  2  4  7,  and  8  -  1 1,  and  wrist  medial/lateral  measurements. 

No  clinical  examination  variable  had  a  poor  level  of  reliability  that  was  considered 

definitive. 


6.3  Hypothesis  #2  -  Diagnostic  Accuracy 

,  niaunnstic  Cliaracteristics  of  Single  ExamjnatipjLlterns_for_£B-Sn4£J^ 

The  first  rater’s  results  were  used  in  all  cases  to  determine  the  diagnostic  accuracy  of 
clinical  examination  variables.  To  calculate  diagnostic  accuracy  characteristics  for  ^ 
clinical  examination  variables,  the  following  procedure  was  employed.  For  CR,  t 
subjects  classified  as  CR  formed  the  disease  positive  group.  All  subjects  classified  as 
normal  or  as  CTS  served  as  the  control  group.  Likewise,  for  CTS,  the  3 1  subjects 
classified  as  CTS  formed  the  disease  positive  group  and  all  subjects  classified  as  norma 
or  CR  seived  as  the  control  group.  The  6  subjects  with  classifications  other  than  normal, 
CR,  or  CTS  were  excluded  from  the  analysis  and  the  remaining  75  subjects  were  used  for 
diagnostic  accuracy  calculations.  For  variables  that  had  no  false  negative  or  false 
positive  findings,  .5  was  added  to  each  cell  for  adjustment.-  The  prevalence  of  CR  and 
CTS  for  the  total  sample  of  81  subjects  was  16%  and  38%,  respectively.  The  diagnostic 
characteristics  for  predictor  variables  are  shown  by  diagnostic  category  in  Table  17  and 
,  8,  Values  that  met  criteria  for  acceptability  are  in  bold.  Questions  1  -  3  are  multi-level 
response  items  and  do  not  have  negative  responses.  Therefore,  diagnostic  characteristic 
are  assigned  to  each  level.  The  Likelihood  ratio  index  (LRi)  associated  with  eac  eve  is 
interpreted  as  a  positive  Likelihood  ratios  (LR+)  because  an  absent  or  “negative” 
response  for  one  level  is  the  positive  response  of  a  different  level.  Descnptive  statistics 
for  intial  NDI  scores  and  for  the  initial  FSS  and  SSS  scores  of  the  CR  and  CTS  groups, 
respectively,  are  listed  in  Table  19. 
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Table  17  Cervical  Radiculopathy:  Senjdti ^(lr+) ^ithTs^clfnfidence  intervals  (95CI) 
Ratios  (LR-),  and  Positive  Likelihood  Ratios  (LR+)  witn  so/. 


Variable 
question  1-' 
bothersome  Sx’s. 

i.  Pain 

ii.  Numb/tingling 

iii.  Loss  of  feeling 
question  2-“Where 
most  bothersome..” 

i.  Neck 

ii.  Shoulder/scap. 

iii.  Arm  AE 

iv.  Arm  BE 
v.  Hand  or  fingers 

question  3-“Sx. 
behavior..” 

i.  Constant 

ii.  Intermittent 

iii.  Variable 
question  6-“Entire 
limb  numb..” 
question  7-“Sx’s. 
keep  from  sleep.." 
question  9-“Neck 
move  improves..” 

C5  Dermatome 
C6  Dermatome 
C7  Dermatome 
C8  Dermatome 
T1  Dermatome 
MMT  deltoid 

MMT  biceps  brachii 
MMT  extensor  carpi 
radiaiis 

longus/brevis 


.46  (.19 -.73) 
.46  (.19  -  .73) 
.08  (.00  -  .92) 


15  (.00-  .35) 
,46  (.19-  .73) 


.38  (.12-  .48) 


Sp  95CI 


.53  (.41  -  .66) 
.55  (.42  -  .67) 
.92  (.85  -  .99) 


.92  (.85  -  .99) 
.84  (.74  -  .93) 
.93  (.87-  1.0) 
.84  (.74  -  .93) 
.48  (.35  -  .60) 


LR-  95CI 


LR+  95C1 


.99  (.52 -1.9) 
1.0  (.53-2.0) 
.95  (.12-7.5) 


1.9  (.41  -8.6) 
2.8  (1.2 -6.5) 


.73  (.35-  1.5) 


.08  (.00  -  .22) 

.56  (.06  -  .56) 

.88  (.35  -  .88) 

.84  (.75  -  .93) 

.63  (.51  -  .75) 

.53  (.41  -  .66) 

* 

.48  (.07  -  3.4) 
.83  (.34  -  2.0) 
1.3  (.79-2.2) 

.23  (.00  -  .46) 

.73  (.61  -  .84) 

1.1  (.76-1.5) 

.84  (.29  -  2.5) 

.62  (.35  -  .88) 

.90  (.82  -  .99) 

.43  (.21  -  .85) 

6.5  (2.3  -  18.0) 

.69  (.44  -  .94) 

.70  (.59  -  .82) 

.44  (.19-1.0) 

2.4  (1 .4-4.0) 

.15  (.00-  .35) 
.15  (.00  -.35) 

.89  (82  -  .97) 
.70  (.59  -  .82) 

.95  (.74  -  1 .2) 
1.2  (.90-1-6) 

1.4  (.32-5.8) 
.52  (.14-2.0) 

.23  (.00  -  .46) 
.15  (.00  -  .35) 

.08  (.00  -  .22) 


.74  (.63  -  .85) 
.82  (.72  -  .92) 
.82  (.72-  .91) 
.89  (.81  -  .97) 
.94  (.87-1.0) 


.87  (.64 -  1.2)  2.0  (.61  -6.9) 

.90  (.71  -1-2  2.4  (.49 -11.7) 


.90  (.83 -.98)  1.02  (.86 -1.2)  .79  (.10  6.1) 


108 


Table  17  (cont’d) 


MMT  triceps  brachii 

.08  (.00  -  .22) 

.94  (.87-  1.0) 

.99  (.83  -  1.2) 

1.2  (.14-9.8) 

MMT  flexor  carpi 

radialis 

*★ 

.90  (.83  -  .98) 

irk 

*-* 

MMT  abductor 

pollicus  brevis 

** 

.84  (.75  -  .93) 

★* 

** 

MMT  first  dorsal 

interosseus 

biceps  brachii  MSR 
Spurling’s  A 
Spurling’s  B 

.23  (.00  -  .46) 

.46  (.19-  .73) 

.46  (.19-  .73) 

.94  (.87-1.0) 

.95  (.90  -  1 .0) 

.87  (.78  -  .95) 

.75  (.65  -  .86) 

.81  (.60-1.2) 

.62  (.37  -  1.03) 

.71  (.42-1.20) 

4.8  (1.1  -21.0) 

3.5  (1.5 -8.4) 

1.9  (.90-3.9) 

Shoulder 

Abduction 

Valsalva 

Distraction 

.08  (-.07  -  .22) 

.31  (.06-  .56) 

.38  (.12-  .65) 

.92  (.85  -  .99) 

.94  (.87-1.0) 
.92  (.84  -  .99) 

1.0  (.84 -1.19) 

.74  (.51-1.1) 

.67  (.43  -  1 .0) 

.95  (.12-7.5) 

4.8  (1.4-16.7) 
4.5  (1.5-13.4) 

Upper  limb  tension 

test  A* 

.96  (.87-  1.0) 

.23  (.12-  .34) 

.15  (.01  -2.4) 

1.3  (1.1  -  1.5) 

Upper  limb  tension 

test  B 

.62  (.35  -  .88) 

.34  (.22  -  .46) 

1.1  (.52-2.5) 

.93  (.58  -  1.5) 

cervical  flexion 

(<37) 

.08  (-.07  -  .22) 

.94  (.87-1.0) 

.99  (.83-1.2) 

1.2  (.14-9.8) 

cervical  extension 

(<55) 

.38  (.13-  .65) 

.71  (.60  -  .82) 

.87  (.55  -  .1.4) 

1.3  (.60-2.9) 

cervical  left  rotation 

(<48) 

.11  (.00-  .32) 

.81  (.64  -  .98) 

1.1  (.80-1.5) 

.58  (.08  -  4.52) 

Cervical  left 

rotation  (<57) 

.89  (.68  -  1 .0) 

.57  (.36  -  .78) 

.19  (.03-1.4) 

2.07  (1.2 -3.6) 

visual  analog  scale 

worse  (>7.5) 

.45  (.16-  .75) 

.87  (.78  -  .96) 

.63  (.36  -  1 .08) 

3.6  (1.38-9.2) 
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.35  (.19  -.52) 
.58  (.41  -  .75) 
.06  (-.02-  .15) 


.45  (.31  -  .60) 
.64  (.49  -  .78) 
.91  (.82  -  .99) 


.65  (.38-1.1) 
1 .6  (.98  -  2.6) 
.71  (.14-3.6) 


Table  18  Carpal  Tunnel  Syndrome:  Sensitivity  (Sn),  Specificity  (Sp),  Negative  Likelihood 
Ra«os  (  W  and  Positive  LiKeHhood  Ratios  (LR+)  with  95%  confidence  intervals  (95CI) 

Variable  I  Sn  95CI  Sp  95CI  LR-  95CI  UR*  95CI 

question  1-“Most 
bothersome  Sx’s. 

i.  Pain 

ii.  Numb/tingling  | 

iii.  Loss  of  feeling 
question  2-“Where 
most  bothersome..” 

i.  Neck 

ii.  Shoulder/scap. 

iii.  Arm  AE 

iv.  Arm  BE 

v.  Hand  or  fingers 

question  3-“Sx. 
behavior.." 

i.  Constant 

ii.  Intermittent 

iii.  Variable 
question  4-“Hand 
fat/swollen..” 
question  5-“Fumble 
and  dropping..” 
question  6-“Entire 
limb  numb..” 
question  8-“Night 
sx’s.  wake..” 
question  10-“Hand 
shaking  improves.” 
question  11 -“Worse 
with  hand  use..” 

Median  sensory 
field  1 

Median  sensory 
field  2 


.06  (-.02 -.15) 

.16  (.03 -.29) 

.06  (.00-  .15) 

.10  (.00 -.20) 

.61  (.44  -  .78) 

.88  (.79  -  .98) 

.74  (.61  -  .87) 

.95  (.89-  1.0) 

.84  (.73  -  .95) 

.58  (.43  -  .73) 

* 

.55  (.12-2.7) 

.63  (.24-1.6) 
1.39  (.21-9.3) 
.59  (.17-2.1) 

1.5  (.93-2.3) 

.23  (.08  -  .37) 

.42  (.25  -  .59) 

.35  (.19-  .52) 

.91  (.82  -  .99) 

.68  (.54  -  .82) 

.41  (.26  -  .55) 

* 

2.5  (.79-7.8) 
1.3  (.72-2.4) 
.60  (.35-1.0) 

.48  (.31  -  .66) 

.66  (.52  -  .80) 

.78  (.52-1.2) 

1.42  (.82-2.5) 

.74  (.59  -  .90) 

.61  (.47  -  .76) 

.42  (.22  -  .80) 

1.9  (1.3- 1.9) 

.35  (.19-  .52) 

.80  (.68  -  .91) 

.81  (.60-1.1) 

1.7  (.82-3.7) 

.56  (.41  -  .71) 

.32  (.18-  .46) 

1 .4  (.79  -  2.4) 

.82  (.59-1.2) 

.77  (.63  -  .92) 

.60  (.46  -  .75) 

.37  (.19 -.75) 

2.0  (1.3 -3.0) 

.74  (.59  -  .90) 

.39  (.24  -  .53) 

.67  (.33-  1.4) 

1.2  (.88-1.7) 

.55  (.37  -  .72) 

.68  (.54  -  .82) 

.66  (.53  -  1 .0) 

1.7  (1.0 -3.0) 

I  .57  (.29  -  .65) 

.65  (.59  -  .78) 

.84  (.56-  1.3) 

1.3  (.74-2.2) 
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Table  18  (cont’d.) 


Median  sensory 

field  3 

.43  (.26 -.61) 

.75  (.62  -  .88) 

.76  (.53-1.2) 

1.7  (.90 -3.3) 

MMT  abductor 

pollicus  brevis 

.19  (  .95  -  .33) 

.91  (.82  -  .99) 

.89  (.73 -1.1) 

2.1  (.66-6.9) 

Tinel’s  A 

.42  (.25  -  .59) 

.56  (.41  -  .71) 

1.0  (.70-1.6) 

.95  (.56 -1.6) 

Tinel’s  B 

.48  (.66 -.31) 

.66  (.52  -  .80) 

.78  (.52 -1.2) 

1.4  (.82 -2.5) 

upper  limb  tension 

test  A 

.71  (.55-  .88) 

.12  (.02-  .22) 

2.4  (.87  -  6.59) 

.81  (.63-1.1) 

upper  limb  tension 

test  B 

.59  (.41  -  .77) 

.20  (.17  -  .44  ) 

1.4  (.73-2.6) 

.84  (.58-1.2) 

carpal  compression 

test 

.74  (.59  -  .90) 

.47  (.32  -  .61) 

.55  (.28 -1.1) 

1.4  (.98 -2.0) 

Phalen’s  test 

.70  (.54-  .86) 

.37  (.23  -  .52) 

.81  (.41  -1.6) 

1.1  (.80-1.6) 

wrist  ratio  (>.68) 

.71  (.55  -  .87) 

.63  (.47  -  .78) 

.68  (.41  -1.1) 

1.5  (.92-2.6) 

wrist  ratio  (>.70) 

.58  (.39  -  .77) 

.63  (.47  -  .78) 

.68  (.41  -1.1) 

1.5  (.92 -2.6) 

Hand  diagram 

.68  (.51  -  .84) 

.42  (.27  -  .57) 

.77  (.41  -1.4) 

1.2  (.82 -1.7) 

Symptom  Severity 
Scale  (>3.2) 

.35  (.19-  .52) 

.86  (.76  -  .96) 

.75  (.56-1.0) 

2.5  (1.1  -6.1) 

Functional  Status 

Scale  (>2.5) 

.42  (25  -  .59) 

.86  (.76  -  .97) 

.67  (.49  -  .93) 

3.1  (1.3 -7.2) 

Table  19  Descriptive  Statistics  for  initial  patient  self-report  measures: 


Measure 

Number  of 
Subjects 

Mean 

1 

/%TO 

Minimum 

Maximum 

HR  CTS 

Std.  Deviation 

CR  CTS 

NDI 
SSS 
FSS 
Valid  N 

CR  CTS 

13  28 

13  31 

13  31 

13  28 

CR 

25% 

2.3 

1.7 

CTS 

21% 

2.8 

2.2 

v>rv 

2% 

1.0 

1.0 

.00 

1.0 

1.0 

64% 

3.6 

3.9 

20% 

4.1 

3.5 

19.9 

.9 

1.0 

14.6 

.8 

.8 

NDI-  Neck  Disability  Index,  SSS-  Symptom  Severity  Scale,  FSS-  Functional  Status  Scale 
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All  diagnostic  accuracy  characteristics  of  provocative  tests  were  computed  using 
dichotomous  ratings  only.  Receiver  operating  characteristic  curves  were  generated  for 
continuous  variables  to  establish  optimum  cut-off  points  and  are  listed  in  Figures  23 
through  28.  Receiver  operating  curves  that  revealed  no  potentially  useful  cut-off  points 
for  the  following  CR  variables  are  not  shown:  NDI,  cervical  right  rotation,  bilateral 
sidebending,  and  the  VASnow  pain  rating.  The  area  under  the  curves  for  each  of  these 
variables  was  less  than  .54.  Receiver  operating  curves  revealed  no  potentially  useftil  cut¬ 
off  values  for  the  following  CTS  variables  and  are  not  shown:  VASnow  and  VASworse 
pain  ratings.  The  area  under  the  curves  for  each  of  these  variables  was  less  than  .42. 
Because  the  proposed  best  cut-off  point  for  the  wrist  ratio  is.70,174  an  ROC  curve  analysis 
was  used  to  determine  whether  this  was  the  best  cut-off  value  in  this  sample  of  patients 
(Figure  29.  The  best  wrist  ratio  cut-off  point  for  this  sample  of  patients  was  .68  with  the 

area  under  the  curve—  .58. 


1  -  Specificity 


Figure  23.  Receiver  operating  curve 


for  cervical  flexion:  Cut-off  37  degrees 
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1  -  Specificity 


Figure  24.  Receiver  operating  curve  for  cervical  Extension:  Cut-off  55  degrees 


1  -  Specificity 


Figure  25.  Receiver  operating  curve  for  cervical  left  rotation:  Cut-offs  <48  and  <57  degrees 
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1  -  Specificity 


Figure  26.  Receiver  operating  curve  for  visual  analog  “worst”  pain  rating  (cervical 
radiculopathy):  Cut-off  >7.5 


1  -  Specificity 


Figure  27.  Receiver  operating  curve  for  Symptom  Severity  Scale:  cut-off  >3.2 


1  -  Specificity 


Figure  28.  Receiver  operating  curve  for  Functional  Status  Scale:  cut-off  >2.5. 


1  -  Specificrty 


Figure  29.  Receiver  operating  curve  for  wrist  ratio:  cut-off  >.68. 
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The  hypothesized  levels  of  diagnostic  accuracy  for  each  clinical  examination  variable  and 
the  status  of  the  hypotheses  are  listed  in  Figures  30  -  33. 


CR  Variables 

Hypothesis  status 

question  2-“Where  most  bothersome..” 

Accepted:  all  levels  except  v. 

question  7-“Symptoms  keep  from  sleep..” 

Accepted 

Sensation 

Accepted:  all  dermatome  levels 

Muscle  stretch  reflexes 

Accepted:  biceps  brachii, 

Undeterminable:  brachiradialis, 

triceps  brachii 

MMT 

Accepted:  all  muscles 

Distraction 

Accepted 

Spuling’s  A 

Accepted 

Spurling’s  B 

Accepted 

Valsalva 

Accepted 

shoulder  abduction 

Accepted 

Figure  30  Hypothesis:  Cervical  radiculopathy  clinical  examination  variables  will 
demonstrate  an  acceptable  level  of  diagnostic  accuracy  (Sn  or  Sp  >  .70  or  LR+  >  2.0  or  LR- 
<  .50) 

~  CR  Variables  ~ 

question  1-“Most  bothersome 
symptoms..” 

question  3-“Symptom  behavior.." 

question  6-“Entire  limb  numb..” 
question  9-“Neck  movement  improves” 
upper  limb  tension  test  A 
upper  limb  tension  test  B 

cervical  range-of-motion 

Visual  analog  scale 
Neck  Diability  Index 

Figure  31.  Hypothesis:  Cervical  radiculopathy  clinical  examination  variables  will  not 
demonstrate  an  acceptable  level  of  diagnostic  accuracy 


Hypothesis  status 

Accepted:  levels  i  and  ii 
Rejected:  level  iii 
Accepted:  level  ii 
Rejected:  levels  i  and  iii 

Rejected 

Rejected 

Rejected 

Accepted 

Accepted:  right  rotation  and  bilateral 
sidebending 

Rejected:  flexion,  extension,  left  rotation 
Rejected 

Accepted 
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Variables 

Hypothesis  status 

question  2-“Where  most 

Accepted :  levels  i,  ii,  iii,  and  iv 

bothersome..” 

Rejected:  level  v 

question  4-“Hand  fat/swollen..: 

Rejected 

question  5-“Fumbling/dropping..” 

Accepted 

question  8-“Night  symptoms 

wake..” 

Rejected 

question  10-“Hand  shaking 
improves.." 

Accepted 

question  11 -“Worse  with  hand 

use..” 

Accepted 

Phalen’s  test 

Accepted 

carpal  compression  test 

Accepted 

Tinel’s  A 

Rejected 

Tinel’s  B 

Rejected 

wrist  ratio 

Rejected 

hand  diagram 

Rejected 

Figure  32  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will 
demonstrate  an  acceptable  level  of  diagnostic  accuracy  (Sn  or  Sp  >  .70  or  LR+  >  2.0  or  LR 
<  .50). 
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Variables 

Hypothesis  status 

question  1-“Most  bothersome 

Accepted:  levels  i,  ii 

symptoms..” 

Rejected:  level  iii 

question  3-“Symptom  behavior..” 

Accepted:  levels  ii  and  iii 

Rejected :  level  i 

question  6-"Entire  limb  numb..” 

Rejected 

Manual  muscle  test  (abductor  pollicus 

brevis) 

Rejected 

Sensation 

Accepted:  median  nerve  fieldl,  median 
nerve  field2 

Rejected:,  median  nerve  field3 

Visual  analog  scale 

Accepted 

Symptom  Severity  Scale 

Rejected 

Functional  Status  Scale 

Rejected 

Figure  33  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will  not 
demonstrate  an  acceptable  level  of  concurrent  validity 


Based  on  the  lower  bound  of  the  95CI,  definitive  findings  for  the  Sp  and  Sn  are  listed  in 
Figure  34.  Variables  that  did  not  have  any  true  positive  or  true  negative  results  were 
excluded.  Although  definitive  results  were  found  to  occur,  only  1 8  variables  had 
adequate  power  and  all  pertained  to  Sp  values  only.  Sixty-two  subjects  classified  as 
positive  for  the  disease  of  interest  are  required  to  achieve  a  95CI  for  a  Sn  of  .80  with  a 
lower  limit  of  .70.  The  same  number  of  non-diseased  subjects  is  required  to  achieve  a 
95CI  for  a  Sp  of  .80  with  a  lower  limit  of  .70.  With  the  exception  of  question  7 
(LR+=6.5.  950=  2.31  -  18.0),  decisions  regarding  definitively  unacceptable  LR’s  were 
not  made  due  a  lack  of  power  for  achieving  the  previously  specified  upper  and  lower 


950  limits. 
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Cervical  radiculopathyVariables 

Sensitivity  Specificity- 

question  7  question  1  (iii) 

question  2(i  -  iv) 
questions  7 
upper  limb  tension  Spurling's  A 
test  A 

question  3(iii)  shoulder  Abduction 

Valsalva 


Carpal  tunnel  syndrome  Variables 

Sensitivity  Specificity 

question  1  (iii) 
question  2(i  &  iii) 
question  3(i) 
symptom  Severity  Scale 

Functional  Status  Scale 
abductor  pollicus  brevis 
muscle  test 


distraction 

upper  limb  tension  test  A 
cervical  flexion 


visual  analog  scale 
(worse) 

C5  dermatome  level 
biceps  brachii  muscle 
stretch  reflex 
all  muscle  tests 


Figure  34.  Definitively  acceptable  sensitivity  and  specificity  findings  of  clinical 
examination  variables 
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Cervical  radiculopathyVariables 

Carpal  tunnel  syndrome  Variables 

Sensitivity 

Specificity 

Sensitivity 

Specificity 

question  2(i  and  v) 

Question  1(i  and  ii) 

question  1(i  and  iii) 

question  3(iii) 

question  3(i  and  ii) 

Question  2(v) 

question  2(i,  ii,  iii,  and  iv) 

question  8 

question  6 

Question  3(iii) 

question  3(i  -  iii) 
questions  4 
question  6 

question  1 1 

C5  and  C6 

upper  limb  tension 

Median  sensory  field  2 

Phalen’s  test 

dermatomes 

test  A  and  B 

Median  sensory  field  3 

carpal  compression 
test  upper  limb 
tension  test  A  and  B 

biceps  brachii 
muscle  stretch  reflex 

abductor  pollicus  brevis 
muscle  test 

hand  diagram 

All  muscle  tests 
Valsalva 

shoulder  abduction 
distraction 

cervical  flexion, 
extension,  left 
rotation 

Tinel's  A  and  B 

Functional  Status  Scale 
Symptom  Severity  Scale 

wrist  ratio 

Figure  35.  Definitively  unacceptable  sensitivity  and  specificity  findings  of  clinical 
examination  variables 


Figure  36  summarizes  clinical  examination  variables  with  acceptable  Likelihood  ratios 
for  each  by  each  respective  condition. 
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Acceptable  Likelihood  ratios  (LR+  >  2.0  or  LR-  <  .50) 


CR  Variables 

CTS  Variables 

LR- 

LR+/LRi 

LR- 

LR+/LRi 

Questions: 

Questions: 

2(ii) 

2.8 

3(i) 

2.5 

7  .43 

6.5 

5  .42 

9  .37 

2.4 

10  .37 

2.0 

Neurological  Examination 

Neurological  Examination 

deltoid  muscle 

abductor  pollicus 

muscle  test 

2.0 

brevis  muscle  test 

2.1 

biceps  brachii 

Muscle  test 

2.4 

biceps  muscle 

Stretch  reflex 

4.8 

Self-report 

Self-report 

visual  analog 

SSS  (>3.2) 

2.5 

scale(  worse) 

3.6 

FSS  (>2.5) 

3.1 

Provocative  Tests 

ULTTA  .15 

Valsalva 

4.8 

distraction 

4.5 

Spurling’s  A 

3.5 

Scaled  measurements 
Involved  cervical 

Rotation  (left  only)  .19  2.1 


Figure  36.  Summary  of  acceptable  Likelihood  ratios 
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6.3.2  Diagnostic  Characteristics  of  Single  Examination  Items  for  CTS  Subclassified 
Groups 

To  assess  the  impact  of  spectrum  bias  on  test  sensitivity,  two  groups  were  formed  from 
the  CTS  diagnosed  subjects.  Group  A  consisted  of  subjects  subclassified  as 
mild/moderate  and  group  B  consisted  of  subjects  subclassified  as  pronounced/severe. 
Subjects  classified  as  normal  or  CR  comprised  the  control  group;  groups  A  and  B  were 
excluded  from  each  other’s  respective  control  group.  Descriptive  statistics  for  the  initial 
SSS,  FSS  and  VASnow  scores  of  the  two  groups  are  listed  in  Table  20.  Because  the 
control  group  remained  unchanged,  only  Sn  values  were  affected.  Therefore,  only  Sn  and 
Likelihood  ratio  values  for  the  two  groups  are  reported  in  Tables  21  -  22.  Calculations 
were  performed  as  previously  described  and  for  variables  that  had  no  false  negative  or 
false  positive  findings,  .5  was  added  to  each  cell  for  adjustment.271  The  cut-off  values 
previously  determined  for  all  CTS  group  were  used  and  no  ROC  curves  or  separate  cut¬ 
off  values  were  established  for  the  two  groups. 

Numerical  differences  in  diagnostic  characteristics  between  the  groups  that  may  indicate 
a  trend  are  in  bold  type  if  the  value  was  considered  acceptable  and  exceeded  the 
following  difference  thresholds:  Sn=  >  .10,  LR-=  >  .15,  and  LR+=  >1.5.  Power  was  not 
satisfactory  to  detect  a  difference  for  the  previously  specified  levels  of  acceptance. 
Question  5(v)  for  the  pronounce/severe  group  was  the  only  variable  with  a  significantly 
different  Sn  value  as  determined  by  the  95CI  interval  and  is  italicized  in  both  tables. 


Table  20.  Descriptive  statistics  of  self-report  instruments  for  subjects  by  group  severity 


Measure 

Group 

N 

Mean 

Std. 

Deviation 

SSS1 

Mild/moderate 

21 

2.82 

.86 

Pronounced/Severe 

10 

2.80 

.42 

FSS1 

Mild/moderate 

21 

2.35 

.85 

Pronounced/Severe 

10 

1.99 

.72 

VASNOW1 

Mild/moderate 

19 

2.8 

2.5 

Pronounced/Severe 

8 

2.4 

2.4 
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Table  21.  Mild/Moderate  Carpal  Tunnel  Syndrome:  Sensitivity  (Sn),  Negative  Likelihood 
Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination  variables  with  95% 
confidence  intervals  (95CI). 


Variable 

Sn  95CI 

LR-  95CI 

LR+  95CI 

question  1-“Most 

bothersome  Sx’s. 

i.  Pain 

.43  (22  -  .64) 

★ 

.79  (.45-  1.4) 

ii.  Numb/tingling 

.52  (.31  -  .74) 

1.4  (.82-2.5) 

iii.  Loss  of  feeling 

question  2-“Where  most 

bothersome..” 

.05  (.00-  .14) 

.52  (.06-4.4) 

i.  Neck 

10  (.00-  .22) 

.82  (.17 -3.9 

ii.  Shoulder/scap. 

.19  (.02-  .36) 

* 

.74  (.27-2.1) 

iii.  Arm  AE 

.10  (.00-  .22) 

2.1  (.31-13.5) 

iv.  Arm  BE 

.14  (.00-  .29) 

.88  (.25  -3.1) 

v.  Hand  or  fingers 

question  3-“Sx.  behavior.." 

.48  (.26  -.69) 

1.1  (.64-2.0) 

i.  Constant 

.14  (.00-  .29) 

1.6  (.39-6.4) 

ii.  Intermittent 

.48  (.26  -  .69) 

* 

1.6  (.80-2.8) 

iii.  Variable 

.38  (.17-  .59) 

.64  (.35-  1.2) 

question  4-“Hand 

fat/swollen..” 

.48  (.26  -  .69) 

.79  (.50-1.3) 

1.4  (.76-2.6) 

question  5-“Fumble  and 

dropping..” 

.67  (.47  -  .87) 

.54  (.28 -1.0) 

1.7  (1.1  -2.7) 

question  6-“Entire  limb 

numb..” 

.38  (.17-  .59) 

.78  (.54-1.1) 

1.9  (.84-4.1) 

question  8-“Night  sx’s. 

wake..” 

.76  (.58  -  .76) 

.75  (.31  -1.8) 

1.1  (.82-1.5) 

question  10-“Hand  shaking 

improves.” 

.76  (.58  -  .76) 

.39  (.18-  .88) 

1.9  (1.2 -3.0) 

question  11 -“Worse  with 

hand  use..” 

.76  (.58  -  .76) 

.62  (.26-1.4) 

1 .2  (.89  -  1 .7) 

Median  sensory  field  1 

.43  (.22  -  .64) 

.84  (.55 -1.3) 

1.4  (.70-2.6) 

Median  sensory  field  2 

.40  (.19-  .61) 

.94  (.62-1.4) 

1.1  (.57-2.1) 

Median  sensory  field  3 

.40  (.19-  .61) 

.80  (.54-1.2) 

1.6  (.76-3.4) 

MMT  abductor  pollicus 

brevis 

.19  (.02-  .36) 

.89  (.71  -1.1) 

2.1  (.58-7.6) 
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Table  21  (cont’d.) 


Tinel’s  A 

.38  (.17-  .59) 

1.1  (.72-1.7) 

.86  (.45  -  1 .6) 

Tinel’s  B 

.48  (.26  -  .69) 

.79  (.50-1.3) 

1.4  (.76-2.6) 

upper  limb  tension  test  A 

.78  (.59  -  .97) 

1.9  (.57-6.2) 

.88  (.67-1.2) 

upper  limb  tension  test  B 

.58  (.36  -  .80) 

1.4  (.69-2.8) 

.83  (.54-1.3) 

carpal  compression  test 

.71  (.52-  .91) 

.61  (2.9 -1.3) 

1.3  (.91-12.0) 

Phalen’s  test 

.70  (.50  -  .90) 

.81  (.37-1.8) 

1.1  (.77-1.6) 

wrist  ratio  (>.68) 

.76  (.68  -  .94) 

.62  (.26-1.4) 

1.2  (.89-1.7) 

wrist  ratio  (>.70) 

.62  (.41  -  .83) 

.67  (.37-1.2) 

1.4  (.89-2.3) 

Hand  diagram 

.67  (.47  -  .87) 

.80  (.40-  1.6) 

1.2  (.77-1.7) 

Symptom  Severity  Scale  (>3.2) 

.43  (.22  -  .64) 

.66  (.45  -  .98) 

3.1  (1.3 -7.5) 

Functional  Status  Scale  (>2.5) 

.48  (.26  -  .69) 

.61  (.40  -  .93) 

3.5  (  1.5 -8.3) 

Table  22  Prounounced/severe  Carpal  Tunnel  Syndrome:  Sensitivity  (Sn),  Negative 
Likelihood  Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination 
variables  with  95%  confidence  intervals  (95CI) 


Variable 

Sn  95CI  LR-  95CI 

LR+  95CI 

question  1-“Most  bothersome 

Sx’s. 

i.  Pain 

.20  (-.06  -  .45) 

.37  (.10-1.3) 

ii.  Numb/tingling 

.70  (.42  -.98) 

1.9  (1.1  -3.4) 

iii.  Loss  of  feeling 

.10  (-.09-  .29) 

1.1  (.14-8.8) 

question  2-“Where  most 

bothersome..” 

i.  Neck 

*-*r 

ii.  Shoulder/scap. 

.40  (-.09  -  .29) 

.39  (.06  -  2.7) 

iii.  Arm  AE 

** 

iv.  Arm  BE 

** 

** 

v.  Hand  or  fingers 

.90  (.71 -1.0) 

2.15(1.4-3.2) 

question  3-“Sx.  behavior..” 

i.  Constant 

.40  (.10-  .70) 

4.4  (1.3-14.7) 

ii.  Intermittent 

.30  (.02  -  .58) 

.94  (.33  -2.7) 

iii.  Variable 

.30  (.02  -  .58) 

.51  (.19-1.4) 
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Table  22  (cont’d.) 


question  4-“Hand 

fat/swollen..” 

.50  (.19 -.81) 

.76  (.39  -1.5) 

1.5  (.70-3.1) 

question  5-“Fumble  and 

dropping..” 

.90  (.71-1.0) 

,16  (.03-1.1) 

2.3  (1.5 -3.6) 

question  6-“Entire  limb 

numb..” 

.30  (.02  -  .58) 

.88  (.57-1.4) 

1.5  (.48-4.5) 

question  8-“Night  sx’s.  wake..” 

.70  (.42  -  .98) 

.94  (.33  -  2.7) 

1.0  (.65-1.6) 

question  10-“Hand  shaking 

improves." 

.80  (.55-1.1) 

.33  (.09-1.2) 

2.0  (1.3 -3.3) 

question  11 -“Worse  with  hand 

use..” 

.70  (.42  -  .98) 

.78  (.28  -  2.2) 

1.1  (.71  -  1.8) 

Median  sensory  field  1 

.80  (.55-1.0) 

.29  (.08-1.0). 

2.5  (1.5 -4.3) 

Median  sensory  field  2 

.60  (.30  -  .90) 

.63  (.28-1.4) 

1.7  (.87-3.1) 

Median  sensory  field  3 

.50  (.19 -.81) 

.67  (.35-1.3) 

2.0  (.90-4.5) 

MMT  abductor  pollicus  brevis 

.20  (-.05  -  .45) 

.88  (.64  -  .88) 

2.2  (.47-10.4) 

Tinel’s  A 

.50  (.19-  .81) 

.90  (.46-1.8) 

1.1  (.56-2.3) 

Tinel’s  B 

.50  (.19 -.81) 

.76  (.39-1.5) 

1.5  (.70-3.1) 

upper  limb  tension  test  A 

.60  (.30  -  .0) 

3.4  (1.1  -10.3) 

.68  (.41  -  1.1) 

upper  limb  tension  test  B 

.60  (.10 -.90) 

1.3  (.55-3.2) 

.86  (.50-1.5) 

carpal  compression  test 

.80  (.55  -  1 .0) 

.73  (.12-1.6) 

1.5  (.99-2.3) 

Phalen’s  test 

.70  (.42  -  .98) 

.81  (.29  -  2.2) 

1.1  (.70-1.8) 

wrist  ratio  (>.68) 

.60  (.10 -.90) 

1 .0  (.44  -  2.4) 

.98  (.56-  .17) 

wrist  ratio  (>.70) 

.60  (.10-  .90) 

.70  (.32  -  1 .6) 

1.4  (.76-2.6) 

Hand  diagram 

.70  (.42  -  .98) 

.72  (.26  -  2.0) 

1.9(75-1.9) 

Symptom  Severity  Scale  (>3.2) 

.20  (-.05  -  .45) 

.93  (.67-1.3) 

1.4  (.34 -6.1) 

Functional  Status  Scale  (>2.5) 

.30  (.02  -  .58) 

.81  (.53-1.2) 

2.2  (.66  -  7.3) 

6.4  Hypothesis  #3  -  Predictive  Validity 

6.4. 1  Predictive  Validity  of  Single  Examination  Items  for  CR  Subjects: 

The  predictive  validity  of  clinical  examination  variables  for  CR  could  not  be 
meaningfully  measured  due  to  low  prevalence.  Ten  out  of  13  (77%)  subjects  returned 
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follow-up  forms  (mean  follow-up  days  =  42.8,  sd=  4.5).  The  surgical  status  was 
reported  for  nine  of  these  subjects;  none  received  surgery.  Seven  subjects  responded 
when  asked  whether  they  had  been  offered  surgery  for  their  condition.  Two  of  these 
subjects  had  been  offered  surgery.  Global  rating  of  change  scores  were  available  for  ten 
out  of  13  subjects.  Based  on  a  GRCS  criteria  of  >  +/-  4points,  four  of  these  subjects 
improved  and  two  were  worse;  one  of  the  two  subjects  who  had  improved  had  been 
offered  surgery.  The  descriptive  statistics  for  NDI  scores  of  subjects  who  improved  are 
compared  with  the  unimproved  group  and  are  listed  in  Table  23.  No  further  analyses  of 
these  subjects  were  performed. 


Table  23.  Descriptive  statistics  of  Neck  Disablity  Scores  at  follow-up. 


Group 

N 

Mean:  F/U  and  Chng 

Standard 

Deviation: 

F/U  and  Chng 

Minimum: 

F/U  and  Chng 

Maximum: 

F/U  and  Chng 

Improved 

4 

6.0/12.5 

5.817.3 

0/-4 

14/36 

Unimproved 

6 

31.8/-4.5 

21.0/7.0 

11/-14 

66/4.0 

F/U=  Follow-up 
Chnge=  Change 


6.4.2  Predictive  Validity  of  Single  Examination  Items  for  CTS  Subjects. 

To  measure  the  predictive  validity  of  clinical  examination  variables  for  CTS  subjects, 
diagnostic  accuracy  characteristics  were  calculated  using  the  same  method  described  for 
hypothesis  #2,  except  selected  measures  of  outcome  were  used  as  the  gold  standard.  The 
results  from  the  first  rater  were  used  in  all  cases  to  determine  the  predictive  validity  of 
clinical  examination  variables.  Outcome  variables  included  surgical  status  and  patient 
perception  of  change  based  on  a  GRCS.  A  change  score  of  >  4  points  on  the  GRCS  was 
used  as  the  cut-off  used  to  classify  a  patient  as  improved  and  a  change  score  of  <  4  was 
used  to  classify  a  patient  as  worsened.  Patients  who  did  not  meet  the  directional  change 
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criteria  served  as  the  control  group.  In  addition  to  the  clinical  examination  and  self- 
report  variables,  the  following  measures  were  used  included  as  predictor  variables. 
FABQA,  FABQB,  median  palmar  sensory  and  motor  latencies,  median  palmar  sensory 
and  motor  amplitudes,  EMG  spontaneous  activity  rating  of  the  abductor  pollicus  brevis 
muscle,  and  duration  of  symptoms. 

Twenty-five  of  the  3 1  subjects  (81%)  classified  with  CTS  returned  follow-up  forms  and 
indicated  whether  they  had  received  surgery  (mean  follow-up  days  =  59.0,  sd=  6.9).  Five 
subjects  from  two  centers  received  surgery  for  their  condition,  one  from  the  University  of 
Pittsburgh  Medical  Center  and  four  from  WHMC.  Three  of  these  subjects  had  improved 
and  one  subject  had  worsened,  and  one  was  unchanged.  An  analysis  of  change  for  the  20 
CTS  subjects  who  had  not  received  surgery  using  the  criteria  of  >  +/-  4  points  resulted  in 
three  subjects  who  improved  and  four  who  worsened.  Due  to  the  low  prevalence  of 
change  in  a  single  direction  (improved  or  worsened),  a  revised  criteria  of  >  +/-  3  points 
was  used.  Using  the  revised  change  criteria,  five  (25%)  subjects  improved  and  seven 
(35%)  subjects  worsened.  Fourteen  of  the  CTS  subjects  who  had  not  received  surgery 
responded  when  asked  whether  they  had  been  offered  surgery.  Six  of  these  14  subjects 
had  been  offered  surgery;  two  had  improved  and  two  had  worsened.  The  descriptive 
statistics  for  SSS  and  FSS  scores  of  subjects  who  changed  are  compared  in  Tables  24  and 

25. 

Table24  Descriptive  statistics  of  self-report  scores  for  improved  and  control  subjects 


Self-report 

Instrument 

N 

Group 

Mean: 

F-U/Chng 

Standard 

Deviation: 

F-U/Chng 

Minimum: 

F-U/Chng 

Maximum: 

F-U/Chng 

Symptom 

5 

Improved 

2J/.58 

.771.66 

1.6/-. 28 

3.6/1 .5 

Severity  Scale 

20 

Unimproved 

2.5/.04 

.871.47 

1.0/-1.4 

3.7/.64 

Funtional 

20 

Improved 

2.4/.S3 

.60/. 44 

1.75/.00 

3.0/1. 1 

Status  Scale 

20 

Unimproved 

2.1/-. 17 

.80/. 62 

1.0/-1.5 

3.3/.70 

F-U=  Follow-up 
Chnge=  Change 
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Tables  Descriptive  statistics  of  self-report  scores  for  worsened  and  control  subjects 


Self-report 

Instrument 

N 

Group 

Mean: 

F-U/Chng 

Standard 

Deviation: 

F-U/Chng 

Symptom 

7 

Worsened 

3.0/.  18 

,52/.36 

Severity  Scale 

13 

Not  Worsened 

2.4/.  17 

,89/.66 

Funtional 

7 

Worsened 

2.7/-.  18 

,53/.73 

Status  Scale 

13 

Not  Worsened 

1 .8/11 

.70/.62 

Minimum: 

F-U/Chng 

2.3/-.37~ 
1.0/-1.4 
2.0/-1 .5 
1.0/-. 14 


Maximum: 

F-U/Chng 

3.6/.60" 
3.7/1 .5 
3.3/.70 
3.0/1. 1 


F-U=  Follow-up 
Chnge=  Change 


6  4.2.1  Surgical  Intervention  Gold  Standard. 

The  five  subjects  who  received  surgery  were  considered  positive  for  the  condition  and  the 
remaining  20  subjects  served  as  the  control  group.  The  prevalence  of  surgical 
intervention  for  the  25  subjects  whose  return  forms  were  available  was  20%. 

All  predictive  validity  characteristics  of  provocative  tests  were  computed  using 
dichotomous  ratings  only.  ROC  curves  were  generated  for  all  continuous  vanables  to 
identify  the  optimum  cut-off  points  and  are  listed  by  diagnosis  in  Figures  37  -  43 . 

Receiver  operating  curves  revealed  no  potentially  useful  cut-off  values  for  the  following 
variables  and  are  not  shown:  VASnow,  VASworse,  FABQ  A,  FABQ  total,  and  median 
sensory  amplitude.  The  area  under  the  curve  for  each  of  these  vanables  was  less  than 
.56.  The  predictive  validity  characteristics  for  the  predictor  variables  are  shown  in  Table 
26.  Values  that  met  levels  of  acceptability  are  in  bold.  Items  with  no  true  positive  or  true 
negative  responses  are  indicated  by  a  double  asterisk(**). 
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1  -  Specificity 


Figure  37.  Receiver  Operating  Curve  for  wrist  ratio:  Cut-off  >.73  degrees 


1  -  Specificity 


Figure  38. 


Receiver  Operating  Curve  for 


Functional  Status  Scale:  Cut-off  >2.3  points 
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1  -  Specificity 


Figure  39.  Receiver  Opening  Curve  for  Fear  Avoidance  Behavior  Questionnaire  B:  Cut- 
off  >31  points 


1  *  Specificity 


Figure  40.  Receiver  Operating  Curve  for  involved  median  palmar  sensor,  latency!  Cut-off 
>.30  milliseconds 
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1  -  Specificity 


Figure  41.  Receiver  Operating  Curve  for  involved  median  motor  latency:  Cut-off  >5.0 
milliseconds 


1  -  Specificity 


Figure  42.  Receiver  Operating  Curve  for  involved  median  motor  amplitude:  Cut-off  <4800 
microvolts 
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1-“Most 


.40  (.00  -  .83) 
.40  (.00  -  .83) 
.25  (.00  -  .60) 


.55  (.33  -  .77) 
.45  (.23  -  .67) 
.98  (.91  -1.0) 


|_R+  95CI 


Table  26  Surgical  intervention  PosWv™  kebhood  Rato  (LR?) 

S^SSSS'^mcon^encein^aiaiSSCi, 

variable  I «"  95cl  Sp  ^  ^  ^ 

question 

bothersome  Sx  s. 

i.  Pain 

ii.  Numb/tingling 
jjj.  Loss  of  feeling 


.89  (.27  -  2.9) 
.73  (.23  -  2.3) 
10.5  (.49- 
206.1) 


question  2-“Where 
most  bothersome..” 

i.  Neck 

ii.  Shoulder/scap. 

iii.  Arm  AE 

iv.  Arm  BE 
v.  Hand  or  fingers 

question  3-“Sx. 
behavior..” 

i.  Constant 

ii.  Intermittent 

iii.  Variable 
question  4-“Hand 
fat/swollen..” 
question  5-“Fumble 
and  dropping.." 
question  6-“Entire  limb 
numb..” 

question  8-“Night  sx's. 
wake..” 

question  10-“Hand 
shaking  improves.” 
question  11 -“Worse 
with  hand  use..” 
Median  sensory  field  1 
Median  sensory  field  2 
Median  sensory  field  3 


92  (.70-1.0) 


.90  (.77-1-0) 
.75  (.56  -  .75) 
.95  (.85-1.0) 
.90  (.77  -  1 .0) 
.50  (.29  -  .50) 


1.8  (1.1- 3.0) 


.79  (.61  -  .97) 

.65  (.44  -  .86) 

.55  (.33  -  .77) 

* 

1.9  (.48-7.6) 

.57  (.09  -  3.6) 
.89  (.27  -  2.9) 

.50  (.28  -  .50) 

1.2  (.52-2.8) 

.80  (.25  -  2.6) 

.26  (.07  -  .45) 

.32  (.02  -  5.0) 

1.3  (.99-1.7) 

.65  (.44  -  .86) 

.92  (.42  -  2.0) 

1.1  (.33-3.9) 

.30  (.10  -  .50) 

1.3  (.38-4.7) 

.86  (.40  -  1 .9) 

.36  (.14-  .56) 

.23  (.02  -  3.5) 

1.3  (.91-1-9) 

.31  (.10  -  -51) 

.27  (.07-4.1) 

1 .4  (1 .0  -  1 .9) 

.80  (.45  -  1  -2)  .53  (.30-. 75)  .38  (.06 -2.3) 

60(17-10)  .61  (.39  -  84)  65  (.21  -  2.0) 

.50  (.01  -  .99)  68  (.48  -  69)  .73  (.26  -  2.0) 


1.7  (.89-3.2) 

1 .5  (.61  -  3.9) 

1.6  (.49-5.2) 
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Table  26  (cont’d) 


"MMT  abductor  pomcu^ 
brevis 
Tinel’s  A 
Tinel’s  B 

upper  limb  tension  test 
A 

upper  limb  tension  test 
B 

carpal  compression 
test 

Phalen’s  test 
wrist  ratio  (>.73) 
wrist  ratio  (>.70) 
hand  diagram 
Palmar  sensory 
latency  (>3.0) 

Motor  latency  (>  50) 
Motor  amplitude 
(<4800) 

Spontaneous  activity 
Functional  Status 
Scale  (>2.3) 

Fear  Avoidance 
Behavior 

Questionnaire  B 

(>  31  points) 
Symptom  duration  a 
(<78  days) 

Symptom  duration 
(>391  days) 


.84  (.68  -  1  0) 

.48  (.16  - 1-4) 

3.8  (1.2-13.4) 

.55  (-33  -  -77) 
.55  (.33  -  .77) 

1 .1  (.48  —  2.5) 

.36  (.06  -  2.2) 

.89  (.27  -  2.9) 
1.8  (.92-3.4) 

.29  (.08  -  .51) 

.68  (.10-4.6) 

1.1  (.66-1-9) 

.39  (.16-  .61) 

1.03  (.30-3.5) 

.98  (.44  -  2.2) 

.60  (•' 

,60  (.17 -1.0)  20  (.02  -.38)  2.0  (.50 -0.0) 

.,0  (.45 -1-2)  .26  (.07  -  46)  75  < 11  ^ 

.60  (.17-1-0)  .60  (62  -  98)  - 

,0  (45-1.2)  .45  (.23 -.67) 

.,0  (45-1.2)  .35  (.14 -.56) 


.44  (.07  -  2.7) 
.57  (.11  -3-6^ 


40  (.00- 83)  -8°  <62’-98)  75(35’16) 

40  (  00  -  83,  .85  (.69  -  1  -0) 

42  (.00  -  08)  .58(91-1.0)  W'* 

.60  (.17-1.0)  .95  (.86  -  1  -0)  MW''* 

.,0  (45-1.2)  .58  (.36 -  80) 


.35  (.06-2.1) 


.40 


44,21  68)  .19  (  01  -21)  1 U -°  -  2-7) 

.92  (.70 -1-1)  44  (.21  -  -68) 

9,(22-1  7)  2.7  (.65-11  0) 

.50,01-  99)  .81  (92-10)  62  (.22 


„  for  each  clinical  examination  variable  and 

The  hypothesized  levels  of  diagnosric  accuracy  f 

the  status  of  the  hypothesis  are  listed  in  Figures  44  -45 . 
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"^JiitiorTTo^Hand  shaking  improves..” 
wrist  ratio 
hand  diagram 

Nerve  conduction  and  electromyography 
findings 

Fear  Avoidance  Behavior  Questionnaire 
question  6-"Entire  limb  numb..” 


Accepted 

Accepted 

Accepted 

Accepted  for  all  parameters 

Accepted,  B  only 
Rejected 


.70  or  LR+  *  2.0  or  LR-  £  .50) 
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Variables 

symptoms..”  (i  (pain)  and  ii 
(numb/tingling)) 

'  question  2-“Where  most  bothersome^. 

(v  (hand/fingers)) 

'^estioiT3^Symptom  behavior^ 
(intermittent)  and  iii  (variable)) 

Tinel’s  A 

Carpal  compression  test 
Upper  limb  tension  test  B 
Visual  analog  scale 
question  4 

Sensation  (median  nerve  fields  2  and  3) 
CCT 

question  1-“Most  bothersome  symptoms 
(iii  (loss  of  feeling)) 

question  2-“Where  most  bothersome.." 
(all  response  levels) 
question  3-“Symptom  behavior.."  0 
(constant)) 

question  5-“Fumbling/dropping. .” 
question  10-“Hand  shaking  improves.." 
question  11 -“Worse  with  hand  use.." 
Sensation  (median  nerve  field  1) 
Manual  muscle  test  (abductor  pollicus 

brevis) 

Tinel’s  B 
Phalen’s  test 
Upper  limb  tension  test  A 
Symptom  Severity  Scale 
Functional  Status  Scale 


'Accepted' 


Accepted 

Accepted 

Accepted 

Accepted 

Accepted 

Accepted 

Accepted 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 

Rejected 
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surgical  CTS  group  -  ,o  the  small 
Patient  perception  of  change  was  not  as.s  charac, ensues  for  the  same 

number  of  subjects  who  renewed  surgery  ■  -  .  ^  using  the 

variables  assessed  in  the  surgical  rnterven  ^  revised  criterion  of  +/-  at 

remaining  20  CTS  subjects  who  did  not  renew  g  .  ^  ^ and  wotSened 
least  3  points  was  used  for  the  predictive  vai  t *  ,ests  were  computed  using 

subjects.  All  predictive  validity  characteristics 

dichotomous  ratings  only 


. .  d  nositive  for  the  condition  and  the 

The  five  subjects  who  improved  were  cons,  er  ^  ^  of  im[)roved  5ubjects 
remaining  15  subjects  served  as  the  contro  ^  ^  R0C  curve  analyses  and  are 
was  25%.  Optimum  cut-off  points  were  ten  ^  ^  ^  reveal  potentially  useful  cut- 
listed  in  Figures  46-52.  Receiver  opera  ing  Sworse>  palmar  latency, 

off  values  for  the  following  variables  and  area  under  the  curve  for  these 

F :ABQ  B’  characteristics  for  the  predictor  vanables  are 
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1  -  Specificity 

Figure  46.  Receiver  Operating  Curve  for  wrist  ratio:  Cut-off  <70. 


1  -  Specificity 


</>  o  oo  1 _ L 


0.00  25 

1  -  Specificity 


1.00 


Figure  48.  Receiver 


Operating  Curve  for  Symptom  Severity  Scale:  Cut-off  >3.0  points. 


1  -  Specificity 


Figure  49.  Receiver  Operating  Curve  for  visual  analog  scale  (now):  Cut-off  >3.4 
centimeters. 
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1  -  Specificity 


Figure  50.  Receiver  Operating  Curve  for  Fear  Avoidance  Behavior  Questionnaire  A.  Cut 
off  >54%. 


1  -  Specificity 


Figure  51.  Receiver  Operating  Curve  for  involved  median  palmar  sensory  amplitude: 
off  >43  microvolts 


Cut- 
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w  0.00 


0.00  25 

1  -  Specificity 


Figure  52.  Receiver  Operating  Curve  for  involved  median  motor  amplitude 
microvolts 


Cut-off  >8110 
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Table  27  Predictors  for  improvement  in  non-surgical  carpal  tunnel  sy^r<||T ^  Likelihood 
Sensitivity  (Sn),  Specificity  (Sp).  Negative  Likelihood  Rat.o  (LR-),  and  Posjive  L  kehhood 
Ratio  (LR+)  of  clinical  examination  variables  with  95%  confidence  in  e 

95CI  LR-  95CI  LR+  95CI 


Variable 

question  1-“Most 
bothersome  Sx’s. 

i.  Pain 

ii.  Numb/tingling 

iii.  Loss  of  feeling 
question  2-“Where 
most  bothersome..” 

i.  Neck 

ii.  Shoulder/scap. 

iii.  Arm  AE 

iv.  Arm  BE 

v.  Hand  or  fingers 
question  3-“Sx. 
behavior..” 

i.  Constant 

ii.  Intermittent 

iii.  Variable 


Sn  95CI  Sp 


.60  (.17 -1.0)  .60  (.35  -.85) 

.40  (.00 -.83)  .40  (.15 -.65) 

**  .99  (.95  -  1 .0)) 


.20  (.00 -.55)  .93  (.81  -1.0) 

.40  (.00 -.83)  .80  (.60 -1.0) 


.02  (.00 -.14)  .87  (.69 -1.0) 


1.5  (.58-3.9) 
.67  (.21-2.1) 


.20  (.00  -  .55) 
.20  (.00  -  .55) 
.60  (.17-1.0) 


.80  (.60  -1.0) 
.60  (.35  -  .85) 
.60  (.35  -  .85) 


3.0  (.23  -  39.6) 
2.0  (.46  -  8.8) 


.15  (.1.1-77.8) 


1.0  (.13-7.6 
.50  (.08  -  3.2) 
1.5  (.58-3.9) 


question  4-“Hand 
fat/swollen..” 

.92  (.70-1.0) 

.66  (.42  -  .89) 

.13  (.01  -  1.9) 

2.7  (1.3 -5.5) 

question  5-“Fumble 
and  dropping..” 

.80  (.45-  1.0) 

.27  (.94  -  .49) 

.75  (.11-5.2) 

1.1  (.64-1.9) 

question  6-“Entire 
limb  numb..” 

.80  (.45-  1.0) 

.80  (.60  -  1 .0) 

.25  (.04  -  .47) 

4.0  (1.3-12.1) 

question  8-“Night 
sx’s.  wake.." 

.80  (.45-  1.0) 

.33  (.09  -  .57) 

.60  (.09  -  4.0) 

1.2  (.68-2.1) 

question  10-“Hand 
shaking  improves." 

.80  (.45 -1.0) 

.40  (.15-  .65) 

.50  (.08  -  3.2) 

1.3  (.73-2.4) 

question  11 -“Worse 
with  hand  use..” 

.80  (.45  -  1.0) 

.33  (.09  -  .57) 

.60  (.09  -  4.0) 

1.2  (.68-2.1) 

Median  sensory  field 

.40  (-.03  -  .83) 

.50  (.24  -  .76) 

1.2  (.29-2.9) 

.80  (.24-2.6) 
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Table  27  (cont’d) 


Median  sensory  field 

2 

.60  (.17-1.0) 

.69  (.44  -  .94) 

.58  (.19-1.8) 

2.0  (.66  -  5.8) 

Median  sensory  field 

3 

.60  (.17-1.0) 

.79  (.57-  1.0) 

.51  (.51  -1.5) 

2.8  (.82-9.6) 

MMT  abductor 

pollicus  brevis 

.20  (.00  -  .55) 

.86  (.67-  1.0) 

.93  (.57-  1.5) 

1.4  (.16-12.3) 

Tinel’s  A 

.80  (.45  -  1.0) 

.67  (.43-  .91) 

.30  (.10-1.8) 

2.4  (1.0 -5.6) 

Tinel’s  B 

.60  (.17-1.0) 

.60  (.35  -  .85) 

.67  (.21  -2.1) 

1.5  (.58-3.9) 

upper  limb  tension 

test  A 

.75  (.33-1.0) 

.31  (.06  -  .56) 

.81  (.12-5.3) 

1.1  (.55-2.1) 

upper  limb  tension 

test  B 

.80  (.45-  1.0) 

.46  (.19-  .73) 

.43  (.07  -  2.8) 

1.5  (.76-2.9) 

carpal  compression 

test 

.92  (.70-  1.0) 

.28  (.06  -  .50) 

.30  (.02-4.7) 

1.3  (.96-1.9) 

Phalen’s  test 

.92  (.70-1.0) 

.33  (.12-  .61) 

.23  (.01  -3.5) 

1.5  (.92-2.3) 

wrist  ratio  low  (<.70) 

.80  (.45-1.0) 

.67  (.43  -  .91) 

.30  (.05  -  1.8) 

2.4  (1.0 -5.6) 

hand  diagram 

.60  (.17-1.0) 

.33  (.09  -  .57) 

1.2  (.33 -4.4) 

.90  (.40  -  2.0) 

palmar  sensory 
amplitude  (>4.3) 

.80  (.45  -  1 .0) 

.67  (.43-  .91) 

.30  (.05  -  1.8) 

2.4  (1.0 -5.6) 

motor  amplitude 

(>8110) 

.80  (.45-  1.0) 

.67  (.43  -  .91) 

.30  (.05  -  1.8) 

2.4  (1.0 -5.6) 

spontaneous  activity 

.60  (.17-1.0) 

.67  (.43  -  .90) 

.60  (.19-1.9) 

1.8  (.62-5.3) 

Functional  Status 

Scale  (>1.7) 

.92  (.70-  1.0) 

.85  (.63-  1.0) 

.10  (.01  -1.4) 

6.1  (1.4-27.5) 

Symptom  Severity 

Scale  (>3.0) 

Fear  Avoidance 

.92  (.70  -1.0) 

.66  (.42  -  .89) 

.13  (.01  -1.9) 

2.7  (1.3 -5.5) 

Behavior 

Questionnaire  A 

(>54%) 

.92  (.70-  1.0) 

.67  (.43-  .91) 

.03  (.00-13.8) 

2.9  (1.4 -6.1) 

visual  analog  scale 

now  (>3.4) 

.75  (.33-  1.0) 

.66  (.42  -  .89) 

.13  (.01-1.9) 

2.7  (1.3 -5.5) 

age  >40 

.92  (.71-1.0) 

.34  (.11  -  .58) 

.24  (.02-3.0) 

1.4  (.91  -2.1) 
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The  hypothesized  level  of  diagnostic  accuracy  for  each  clinical  examination  variable  and 
the  status  of  the  hypothesis  is  listed  in  Figures  53  -  54. 


Variables  Hypothesis  status 

Accepted 


question  6-“Entire  limb 
numb..” 

question  10-“Hand  shaking 

improves..” 

wrist  ratio 

Nerve  conduction  and 
electromyography  findings 
Fear  Avoidance  Behavior 
Questionnaire 
hand  diagram 


Accepted 

Accepted 

Accepted,  all  parameters 
except  spontaneous  activity 

Accepted,  A  only 
Rejected 


Figure  53.  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will 
demonstrate  an  acceptable  level  of  predictive  validity  for  improvement  (Sn  or  Sp  >  .70  or 
LR+  >  2.0  or  LR-  <  .50) 
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Variables  Hypothesis  status 

question  1-“Most  Accepted 

bothersome  symptoms  (i 
(pain)  and  iii  (loss  of 
feeling)), 


question  2-“Where  most 

bothersome..’’  (iii  (arm  AE) 

and  iv  (arm  BE)) 

Accepted 

question  3-“Symptom 

Accepted 

behavior..”  (ii  (intermittent) 

and  iii  (variable)) 

Tinel’s  B 

Accepted 

question  4-“Hand 

Rejected 

fat/swollen..” 

question  5- 

Rejected 

“Fumbling/dropping..” 

question  8-“Night  symptoms 

Rejected 

wake..” 

question  10-“Hand  shaking 

Rejected 

improves..” 

question  1 1 -“Worse  with 

Rejected 

hand  use..” 

Sensation 

Rejected 

Manual  muscle  test 

Rejected 

(abductor  pollicus  brevis)  6 

Tinel’s  A 

Rejected 

Carpal  compression  test 

Rejected 

Phalen’s  test 

Rejected 

Upper  limb  tension  test  A 

Rejected 

Upper  limb  tension  test  B 

Rejected 

Visual  analog  scale 

Rejected 

Symptom  Severity  Scale 

Rejected 

Functional  Status  Scale 

Rejected 

Table  54.  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will  not 
demonstrate  an  acceptable  level  of  predictive  validity  for  improvement 
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6.4.2.2.2  Worsened  CTS  Subjects 

The  seven  subjects  who  worsened  were  considered  positive  for  the  condition  and  the 
remaining  15  subjects  served  as  the  control  group.  The  prevalence  of  worsened  subjects 
was  35%.  All  predictive  validity  characteristics  of  provocative  tests  were  computed 
using  dichotomous  ratings  only.  Optimum  cut-off  points  were  identified  using  ROC 
curve  analyses  and  are  listed  in  Figures  55  -  58.  Receiver  operating  curves  did  not 
reveal  potentially  useful  cut-off  values  for  the  following  variables  and  are  not  shown: 
VASnow,  FABQ  A,  B  and  Total,  median  palmar  sensory  latency  and  amplitude,  median 
motor  latency  latency  and  amplitude.  The  area  under  the  curve  for  each  of  these 
variables  was  less  than  .47  except  for  VASnow.  The  VASnow  variable  had  an  area  under 
the  curve  of.69,  which  was  less  than  the  VAS worse  area  of  .87  and  therefore  was  not 
included.  The  predictive  validity  characteristics  for  the  predictor  variables  are  shown  in 
Table  28. 


1  -  Specificity 


Figure  55.  Receiver  Operating  Curve  for  wrist  ratio  :  Cut-off  >.70. 
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1  -  Specificity 


Figure  56.  Receiver  Operating  Curve  for  Symptom  Severity  Scale  :  Cut-off  >2.0  points 


1  -  Specificity 


Figure  57.  Receiver  Operating  Curve  for  Functional  Status  Scale  :  Cut-off  >2.0  points 
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Table  28  Predictors  of  worsening  in  non-surgical  carpal  tunnel  syndrome  subjects: 
Sensitivity  (Sn),  Specificity  (Sp),  Negative  Likelihood  Ratio  (LR-),  and  Positive  Likelihood 
Ratio  (LR+)  of  clinical  examination  variables  with  95%  confidence  intervals  (95C1) 


Variable 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

question  1-“Most 

bothersome  Sx’s. 

i.  Pain 

.43  (.06  -  .80) 

.54  (.27  -  .54) 

* 

.93  (.33-2.6) 

ii.  Numb/tingling 

.57  (.20  -  .94) 

.46  (.19 -.46) 

1.1  (.47-2.4) 

iii.  Loss  of  feeling 

question  2-“Where 

most  bothersome..” 

*-* 

** 

i.  Neck 

** 

.77  (.54-1.0) 

ii.  Shoulder/scap. 

.29  (.00-  .62) 

.77  (.54  -  1 .0) 

* 

1.2  (.27-5.8) 

iii.  Arm  AE 

.14  (.00-  .46) 

.96  (.87-  1.0) 

5.3  (.24-114.3) 

iv.  Arm  BE 

.31  (.00  -  .63) 

96  (.87-1.0) 

8.8  (.48-160.5) 

v.  Hand  or  fingers 

question  3-“Sx. 

behavior..” 

.29  (.00  -  .62) 

.38  (.12 -.65) 

.46  (.13-1.6) 

i.  Constant 

.01  (.00 -.10) 

.69  (.44  -  .94) 

* 

.05  (.00-22.7) 

ii.  Intermittent 

.43  (.06  -  .80) 

.69  (.44  -  .94) 

1.4  (.43-4.5) 

iii.  Variable 

.57  (.20  -  .94) 

.62  (.35  -  .88) 

1.5  (.58-3.8) 

question  4-“Hand 

fat/swollen..” 

.57  (.20  -  .94) 

.54  (.27-  .81) 

.80  (.30-2.2) 

1.2  (.52-3.0) 

question  5-“Fumble 

and  dropping..” 

.94  (.39  -  .90) 

.39  (.14-  .65) 

.16  (.01  -2.5) 

1.6  (.98-2.4) 

question  6-“Entire 

limb  numb..” 

.29  (.00  -.62) 

.62  (.35  -  .88) 

1.2  (.61  -2.2) 

.74  (.19-2.9) 

question  8-“Night 

sx’s.  wake..” 

.83  (.54-1.0) 

.43  (.17-  .69) 

.39  (.06  -  2.6) 

1.5  (.82-2.6) 

question  10-“Hand 

shaking  improves.” 

.86  (.46  -  .60) 

.46  (.19-  .73) 

.31  (.05-2.1) 

1.6  (.88-2.9) 

question  11 -“Worse 

with  hand  use..” 

.71  (.38-1.0) 

.31  (.06  -  .56) 

.93  (.22  -  3.9) 

1.0  (.57-1.9) 

Median  sensory 

field  1 

.50  (.10  -.90) 

.54  (.27 -.81) 

.93  (.36-2.4) 

1.1  (.40-2.9) 

Median  sensory 

field  2 

.40  (.03  -  83) 

.62  (.35-  .88) 

.97  (.42  -  2.3) 

1.0  (.29-3.7) 
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Table  28  (cont’d) 


Median  sensory 

field  3 

.17  (.13 -.46) 

.62  (.35  -  .88) 

1.4  (.77-2.4) 

.43  (.06-3.0) 

MMT  abductor 

pollicus  brevis 

.17  (.13  -  .46) 

.85  (.65-  1.0) 

.98  (.64-1.5) 

1.1  (.12-9.8) 

Tinel’s  A 

.29  (.00  -  .62) 

.46  (.19  -  .73) 

1.6  (.73  -3.28 

.53  (.15-1.9) 

Tinel’s  B 

.43  (.06  -  .80) 

.54  (.27-  .81) 

1.1  (.47-2.4) 

.93  (.33-2.6) 

upper  limb  tension 

test  A 

.80  (.45-  1.0) 

.33  (.07  -  .60) 

.60  (.09-4.1) 

1.2  (.66-2.2) 

upper  limb  tension 

test  B 

.80  (.45-  1.0) 

.46  (.19-  .73) 

.43  (.07  -  2.8) 

1.5  (.76-2.9) 

carpal 

compression  test 

.86  (.46  -  .60) 

.23  (.00  -  .46) 

.62  (.08  -  4.9) 

1.3  (.67-2.6) 

Phalen’s  test 

.93  (.74-1.0) 

.39  (.14-  .38) 

.18  (.01-2.8) 

1.5  (.96-2.4) 

wrist  ratio  high 
(>.70) 

.86  (.60-1.0) 

.62  (.35  -  .88) 

.23  (.04-1.5) 

2.2  (1.1  -4.72) 

hand  diagram 

.86  (.60-1.0) 

.46  (.19 -.73) 

.31  (.05-2.1) 

1.6  (.88-2.9) 

Spontaneous 

activity 

.14  (.00  -  .40) 

.92  (.78-  1.0) 

.93  (.22  -  3.9) 

1.9  (.14-25.4 

Functional  Status 

Scale  (>2.0) 

.86  (.60 -1.0) 

.62  (.35  -  .88) 

.23  (.04  -  .50) 

2.2  (1.1  -4.7) 

Symptom  Severity 
Scale  (>2.0) 

.94  (.77-1.0) 

.32  (.08  -  .57) 

.19  (.01  -3.2) 

1.4  (.92-2.1) 

Visual  analog  scale 

worse  (>4.9) 

.86  (.00-1.0) 

.83  (.62-1.0) 

.17  (.03-1.1) 

5.1  (1.4-18.9) 

The  hypothesized  level  of  diagnostic  accuracy  for  each  clinical  examination  variable  and 
the  status  of  the  hypotheses  are  listed  in  Figures  59  -  60. 
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Variables 

Hypothesis  status 

question  10-“Hand  shaking  improves..” 

Accepted 

wrist  ratio 

Accepted 

hand  diagram 

Accepted 

Nerve  conduction  and 

Accepted:  Spontaneous 

electromyography  findings 

activity  in  abductor  pollicus 

brevis 

Rejected:  all  other  parameters 

question  6-“Entire  limb  numb..” 

Rejected 

Fear  Avoidance  Behavior  Questionnaire 

Rejected 

Figure  59.  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will 
demonstrate  an  acceptable  level  of  predictive  validity  for  worsening  (Sn  or  Sp  2 .70  or  LR+ 
>  2.0  or  LR-  <  .50) 
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Variables 

Hypothesis  status 

question  1 -‘‘Most  bothersome 

symptoms.. ”(i  (pain),  ii  (numb/tingling), 

iii  (loss  of  feeling)) 

Accepted 

question  2  (v  (hands/fingers)) 

Accepted 

question  3-“Symptom  behavior..”  (i 

(constant),  ii  (intermittent),  iii  (variable)) 

Accepted 

question  4-“hand  fat/swollen..” 

Accepted 

Sensation  (all  median  nerve  fields) 

Accepted 

Tinel’s  A 

Accepted 

TineFs  B 

Accepted 

questions  5-“Fumbling/dropping..” 

Rejected 

question  8-“Night  symptoms  wake..” 

Rejected 

question  10-“Hand  shaking  improves..” 

Rejected 

question  1 1 -“Worse  with  hand  use..” 

Rejected 

Manual  muscle  test  (abductor  pollicus 

brevis) 

Rejected 

Carpal  compression  test 

Rejected 

Phalen’s  test 

Rejected 

Upper  limb  tension  test  A 

Rejected 

Upper  limb  tension  test  B 

Rejected 

Visual  analog  scale 

Rejected 

Symptom  Severity  Scale 

Rejected 

Functional  Status  Scale 

Rejected 

Figure  60  Hypothesis:  Carpal  tunnel  syndrome  clinical  examination  variables  will  not 
demonstrate  an  acceptable  level  of  predictive  validity  for  worsening 


In  summary,  Tables  29  through  31  lists  the  variables  with  acceptable  predictive 
Likelihood  ratios  for  surgical  intervention  and  for  change  in  non-surgically  treated  CTS 
subjects. 
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Table  29.  Acceptable  Likelihood  ratios  of  surgical  predictors.  (LR+  2  2.0  or  LR-  <  .50) 


Category/Test  item 

Negative  Likelihood  Ratio 

Positive  Likelihood  Ratio 

Questions 

question  1-“Most  bothersome 

symptoms.."  (iii  (loss  of  feeling)) 

-- 

10.5 

question  5-“Fumbling/ 
dropping..” 

.32 

- 

question  10-“Hand  shaking 
improves..” 

.23 

- 

question  11 -“Worse  with  hand 

use..” 

.27 

-- 

Neurologic  Examination 

median  nerve  field  1 

.38 

- 

abductor  pollicus  brevis  muscle 

test 

.48 

3.8 

Self-Report 

Functional  Status  Scale 

!  .35 

~ 

Provocative  Tests  &  Measures 

Tinel’s  B 

.36 

- 

wrist  ratio  >.73 

.50 

3.0 

wrist  ratio  >.70 

.44 

- 

EMG/NCS  (median  nerve 
parameters;  ms=milliseconds; 
uv=  microvolts) 

palmar  sensory  latency  (>3.0ms) 

2.0 

motor  latency  (>5.0ms) 

-- 

2.7 

Motor  amplitude  (<4800uv) 

- 

17.5 

Spontaneous  activity  (abductor 
pollicus  brevis  muscle) 

.42 

11.4 

Additional  Predictors 

Symptom  duration  (<78  days) 

.19 

- 

Symptom  duration  (>391  days) 

-- 

2.7 
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Table  30.  Acceptable  Likelihood  ratios  of  improved  predictors.  (LR+  £  2.0  or  LR-  S  .50) 


Category/Test  item 


Negative  Likelihood  Ratio  Positive  Likelihood  Ratio 


3.0 


Questions 

question  2-“Where  most 
bothersome..”  (i  (neck)) 
question  2-“Where  most 
bothersome. .”(ii  (shoulder, 
shoulder  blade)) 
question  4-“Hand  fat/swollen..” 
question  6-“Entire  limb  numb..” 
Neurologic  Examination 

median  nerve  field  2 
median  nerve  field  3 
Self-Report 

Functional  Status  Scale  (>.1.7) 

Symptom  Severity  Scale(>3.0) 

Fear  Avoidance  Behavior 

Questionnaire  A  (>54%) 

Visual  analog  scale(now)  (>3.4) 

Provocative  Tests  &  Measures 

Tinel’s  A 

upper  limb  tension  test  B 

Carpal  compression  test 

Phalen’s  test 

wrist  ratio  >.70 

EMG/NCS  (median  nerve 
parameters;  ms=milliseconds;  uv= 
microvolts) 

palmar  sensory  amplitude  (>43uv) 
motor  amplitude  (>8100uv) 


“ 

2.0 

.13 

2.7 

.25 

4.0 

2.0 

2.8 

.10 

6.1 

.13 

2.7 

.13 

2.9 

.13 

2.7 

.30 

2.4 

.43 

-- 

.30 

- 

.23 

- 

.30 

2.4 

.30 

2.4 

.30 

2.4 
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Table  31.  Acceptable  Likelihood  ratios  of  worsened  predictors.  (LR+  2  2.0  or  LR-  S  .50) 


Category/Test  item 

Negative  Likelihood  Ratio 

Positive  Likelihood  Ratio 

Questions 

question  2-“Where  most 

bothersome..”  (iii  (arm  AE)) 

-- 

5.3 

question  2-“Where  most 

bothersome..”  (arm  BE  (iv)) 

-- 

8.8 

question  5-“Fumbling/dropping..” 

.16 

- 

question  8-“Night  symptoms 

wake..” 

.39 

- 

question  10-“Hand  shaking 

improves..” 

.31 

- 

Self-Report 

Functional  Status  Scale  (>2.0) 

.23 

2.2 

Symptom  Severity  Scale(>2.0) 

.19 

-- 

Visual  analog  scale(now)  (>4.9) 

.17 

5.1 

Hand  diagram  (1  or  2  or3) 

.16 

-- 

Provocative  Tests  &  Measures 

upper  limb  tension  test  B 

.43 

- 

Phalen’s  test 

.18 

- 

wrist  ratio  >.70 

.23 

2.2 

6.5  Hypothesis  #4  -  Test  Item  Cluster  (TIC) 

A  binary  logistic  regression  model  was  used  solely  as  a  variable  reduction  method  to 
identify  the  most  accurate  and  parsimonious  TIC  for  all  diagnostic  tests  and  their 
respective  dependent  variable  conditions.23  Duration  of  symptoms  was  eliminated  from 
the  model  because  duration  of  symptoms  was  missing  for  20%  -  25  %  of  subjects  in  this 
analysis.  Due  to  the  large  number  of  predictor  variables,  only  variables  with  an  LR+ 
point  estimate  >  2.0  or  an  LR-  point  estimate  of  <  .50  were  entered  in  to  the  regression 
model.  This  preliminary  variable  reduction  method  represents  a  change  from  the  criteria 
originally  specified  due  to  the  wide  95CI’s  for  the  test  variable  LR’s  in  this  study.  All 
continous  measures  were  entered  into  the  model  as  dichotomized  variables  based  on 
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previously  established  cut-off  values.  However,  variables  that  had  multiple  unique-level 
response  items  (questions  1-3)  were  not  transformed  prior  to  entering  the  model. 

The  Hosmer-Lemeshow  (HL)  summary  goodness-of-fit  statistic  was  used  to  assess  the  fit 
of  the  model  to  the  data.  The  HL  test  is  based  on  the  distance  between  the  observed  and 
fitted  values,  is  well  approximated  by  the  Pearson  Chi-square  distribution  with  dfi=  g  -  2 
(g  is  the  percentile-type  grouping  of  observed  and  fitted  values,  with  g  usually  equal  to 
10  groups),  and  provides  a  single,  easily  interpretable  value  which  can  be  used  to  assess 
model-data  fit.  The  HL  tests  the  hypothesis  that  the  model  fits  the  data.  Therefore, 
higher  values  of  p  indicate  a  better  fit.272pp  142 ' 144  Because  the  purpose  of  the  logistic 
regression  model  was  strictly  variable  reduction,  analyses  of  the  individual  residuals  and 

23 

additional  diagnostic  statistics  of  the  fitted  model  were  not  performed. 

6,5,1  Diagnostic  Characteristics  of  the  CR  TIC 

For  the  CR  TIC,d  backward  step-wise  selection  was  used  to  enter  variables  into  the 
logistic  regression  model  with  p  values  of .  1  and  .  1 5  to  enter  and  exit  the  model, 
respectively.  The  method  of  entry  and  liberal  p-values  were  chose  in  order  to  prevent 
potentially  useful  variables  from  being  excluded  from  the  model.273.  The  following  test 
variables  were  entered  into  the  regression  model  as  predictors  for  CR:  ULTTA,  question 
7-“Symptoms  keep  from  falling  asleep..”,  question  9-“Neck  movement  improves..”, 
Valsalva,  biceps  brachii  MSR,  Distraction,  VASworse,  Spurling  A,  question  2,  MMT 
biceps,  and  MMT  deltoid.  After  list-wise  deletion,  a  total  of  60  subjects  were  used  in  the 
analysis.  The  results  of  the  HL  test  indicated  the  model  fit  the  data  (final  p=  .93).  The 
four  test  variables  chosen  by  the  model  and  therefore  considered  the  best  CR  TIC 
include:  question  9,  Valsalva,  biceps  brachii  MSR,  and  Distraction.  The  diagnostic 
characteristics  of  the  CR  TIC  were  calculated  as  for  other  dichotomous  variables  using  a 
typical  2X2  contingency  table.  When  a  zero  cell  value  was  encountered,  .5  was  added 
to  all  cell  values  in  the  table.  Three  different  criterion  levels  for  a  positive  test  were 
established  based  on  number  of  positive  findings  for  variables  in  the  TIC.  Table  32  lists 
the  diagnostic  characteristics  of  the  CR  TIC.  Values  that  were  considered  acceptable  are 
in  bold. 
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Table  32  Test  item  cluster  for  the  diagnosis  of  cervical  radiculopathy  :  question  9-‘‘Neck 
movement  improves..”,  Valsalva,  Biceps  brachii  muscle  stretch  reflex,  and  distraction. 
Sensitivity  (Sn),  Specificity  (Sp),  Negative  Likelihood  Ratio  (LR-),  and  Positive  Likelihood 
Ratio  (LR+)  of  clinical  examination  variables  with  95%  confidence  intervals  (95CI) 


Number  of  any 

positive  findings 

in  cluster 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

One 

.85  (.65-  1.0) 

.57  (.44  -  .69) 

.27  (.07  -.99) 

2.0  (1 .3  -  2.8) 

Two 

.54  (.27-  81) 

.93  (.87-  .1.0) 

.50  (.27  -  .89) 

8.1  (2.8  -  23.6) 

Three 

.25  (.01  -  .48) 

.99  (.97-1.0) 

.76  (.56-1.0) 

30.5  (1.7-557.5) 

The  hypothesis  that  a  combination  of  clinical  examination  variables  and/or  patient  self- 
report  items  can  be  found  to  yield  acceptable  levels  of  diagnostic  accuracy  for  CR  is 
accepted. 

The  hypothesis  regarding  the  predictive  validity  of  a  TIC  for  surgical  intervention  and 
improvement  in  subjects  CR  could  not  be  tested  due  to  the  absence  of  surgical 
intervention  and  the  low  prevalence  of  CR  subjects. 

6  5.2  CTS  TIC’s 

Predictor  variables  for  the  CTS  diagnosis  TIC  were  entered  into  the  regression  model 
using  the  same  variable  entry  procedure  described  for  the  CR  TIC.  However,  all  the 
predictive  validity  CTS  TIC’s  still  had  a  large  number  of  variables  using  the  initial 
variable  reduction  criteria.  Therefore,  the  following  revised  preliminary  variable 
reduction  procedure  and  criteria  were  used:  1  Only  variables  with  an  LR+  of  >  3.0  or  an 
LR-  value  of  <  .30  were  entered  in  the  model;  2.  Variables  with  both  LR+  and  LR- 
values  that  met  the  previous  inclusion  criteria  of  >  2.0  or  <  .50,  respectively,  were 
transformed  to  the  same  scale  by  taking  the  natural  log  of  both  LR’s.  The  natural  log 
values  were  then  summed  to  a  single  LRprime  value  and  variables  with  an  LRprime 
value  of  1 .2  were  included  in  the  model.  An  LR  prime  value  of  1 .2  is  equivalent  to  LR+ 
and  LR-  values  of  .30  and  3.4,  respectively.  For  predictive  validity  TIC’s,  the  additional 
predictor  variables  previously  considered  were  also  assessed,  which  included:  FABQA 
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score,  FABQB  score,  palmar  sensory  latency,  motor  latency;  palmar  sensory  amplitude, 
motor  amplitudes,  and  the  EMG  spontaneous  activity  rating  of  the  abductor  pollicus 
brevis. 

6.5.2. 1  Diagnostic  Characteristics  of  the  CTS  diagnosis  TIC 
The  following  test  variables  were  entered  into  the  regression  model:  FSS  (>2.5), 
question  10-“Hand  shaking  improves..”,  SSS  (>3.2),  question  3-“Symptom  behavior..”, 
question  5-‘Tumbling/dropping.”,  and  MMT  abductor  pollicus  brevis.  After  list-wise 
deletion,  a  total  of  73  subjects  were  used  in  the  analysis.  The  results  of  the  HL  test 
indicated  the  model  fit  the  data  (final  p=  .77).  The  four  test  variables  chosen  by  the 
model  and  therefore  considered  the  best  CTS  TIC  include:  questions  3-“Symptom 
behavior..”,  and  10-“Hand  shaking  improves..”,  SSS,  and  MMT  abductor  pollicus  brevis. 
Question  3  was  dichotomized  into  positive  (i  (constant))  and  negative  (ii  (intermittent) 
and  iii  (variable))  responses.  The  diagnostic  characteristics  of  the  CTS  TIC  were 
calculated  as  for  other  dichotomous  variables  using  a  typical  2  X  2  contingency  table.  As 
with  the  CR  diagnosis  TIC,  three  different  criterion  levels  for  a  positive  test  were 
established  based  on  number  of  positive  findings  for  variables  in  the  TIC.  Table  33  lists 
the  diagnostic  characteristics  of  the  CTS  diagnosis  TIC 


Table  33.  Test  item  cluster  for  the  diagnosis  of  carpal  tunnel  syndrome  :  question  10- 
“Hand  shaking  improves..”,  question  3-“Symptom  behavior..’’,  Symptom  Severity  Scale 
(>3.2),  and  abductor  pollicus  brevis  muscle  test.  Sensitivity  (Sn),  Specificity  (Sp),  Negative 
Likelihood  Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination 
variables  with  95%  confidence  intervals  (95CI) 


Number  of  any 

positive  findings  in 

cluster 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

One 

.94  (.84-  1.0) 

.47  (.32-  .61) 

.14  (.04  -  .55) 

1.7  (1.3 -2.4) 

Two 

.45  (.28  -  .63) 

.82  (.79  -  .93) 

.67  (.47  -  .95) 

2.5  (1.2  -  5.2) 

Three 

.16  (.03-  .29) 

.98  (.93-1.0) 

.86  (.73-  1.0) 

7.1  (.87  -  57.8) 
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The  hypothesis  that  a  combination  of  clinical  examination  variables  and/or  patient  self- 
report  items  can  be  found  to  yield  acceptable  levels  of  diagnostic  accuracy  for  CTS  is 
accepted. 

6. 5, 2, 2  Diagnostic  Characteristics  of  the  CTS  Surgery  TIC 

The  following  test  variables  were  entered  into  the  regression  model:  questions  l-‘Most 
bothersome  symptoms..”,  2-“Where  most  bothersome..”,  10-“Hand  shaking  improves..”, 

1  l-“Worse  with  hand  use..”,  5-‘Tumbling/dropping..”,  MMT  abductor  pollicus  brevis, 
and  wrist  ratio(>  .73).  After  list-wise  deletion,  a  total  of  25  subjects  were  used  in  the 
analysis.  The  results  of  the  HL  test  indicated  the  model  fit  the  data  (final  p=  .99).  The 
four  test  variables  chosen  by  the  model  and  therefore  considered  the  best  CTS  TIC 
include:  questions  l-“Most  bothersome  symptoms.”  and  2-“Where  most  bothersome..”, 
MMT  abductor  pollicus  brevis,  and  wrist  ratio  (>.73).  Question  1  was  dichotomized  into 
positive  (iii  (loss  of  feeling))  and  negative  (i  (pain)  and  ii  (numb/tingling))  responses. 
Question  2  was  also  dichotomized  into  positive  (v  (hand/fingers))  and  negative  (i  (neck), 
ii  (shoulder/shoulder  blade),  iii  (arm  AE),  and  iv  (arm  BE))  responses.  The  diagnostic 
characteristics  of  the  CTS  TIC  were  calculated  as  for  other  dichotomous  variables  using  a 
typical  2X2  contingency  table.  Three  different  criterion  levels  for  a  positive  test  were 
established  based  on  number  of  positive  findings  for  variables  in  the  TIC.  Table  34  lists 
the  diagnostic  characteristics  of  the  CTS  surgical  TIC. 


Table  34  Test  item  cluster  for  carpal  tunnel  syndrome  surgical  predictors:  question  1- 
“Most  bothersome  symptoms..",  question  2-“Where  most  bothersome..”,  MMT  abductor 
pollicus  brevis,  and  wrist  ratio  (>.73).  Sensitivity  (Sn),  Specificity  (Sp),  Negative  Likelihood 
Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination  variables  with  95% 
confidence  intervals  (95CI) 


Number  of  any 

positive  findings  in 

cluster 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

One 

.92  (.70-  1.0) 

.31  (.11  -  .51) 

.27  (.02-4.1) 

1.3  (.91  -  1.9) 

Two 

.92  (.70-  1.0) 

.83  (.99  -  .83) 

.10  (.01  -1.4) 

5.5  (2.1  -  14.7) 

Three 

.42  (.92  -  .81) 

.98  (.91  -1.0) 

.60  (.30-1.0) 

17.5  (.97-317.3) 
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The  hypothesis  that  a  combination  of  clinical  examination  variables  and/or  patient  self- 
report  items  can  be  found  to  yield  acceptable  levels  of  diagnostic  accuracy  for  CTS  is 
accepted. 

6.5. 2, 3  Diagnostic  Characteristics  of  the  CTS  Improved  TIC 
After  preliminary  variable  reduction,  the  following  test  variables  met  the  criteria  and 
were  initially  entered  into  the  regression  model:  FSS  (>1.7),  FABQA  (>54%),  questions 
4-“Hand  fat/swollen..”  and  6-“Entire  limb  numb..”,  CCT,  wrist  ratio  (<  70),  Phalen’s  test, 
TinelA,  palmar  sensory  amplitude  (>4.3),  motor  amplitude  (>8110),  question  2-“Where 
most  bothersome..”,  and  VASnow  (>3.4).  Due  to  the  large  number  of  variables,  the  co- 
variance  matrix  of  the  regression  model  could  not  be  calculated.  The  model  was  re¬ 
calculated  after  one  variable  at  a  time  was  removed.  Variables  were  removed  from  the 
model  based  on  the  lowest  LRprime  value.  After  a  total  of  six  iterations,  a  solution  to 
the  model  was  reached  with  the  following  variables  entered  into  the  model:  FSS, 

FABQA,  questions  4-“Hand  fat/swollen..”  and  6-“Entire  limb  numb..”,  CCT,  and  wrist 
ratio  (<70).  No  missing  data  were  identified  by  list-wise  deletion  and  all  20  subjects 
were  used  in  the  analysis.  The  results  of  the  HL  test  indicated  the  model  fit  the  data 
(final  p=  1.0).  The  three  test  variables  chosen  by  the  model  and  therefore  considered  the 
best  CTS  TIC  include:  question  4,  FABQA  (>54%),  and  wrist  ratio  (<.70).  The 
diagnostic  characteristics  of  the  CTS  TIC  were  calculated  as  for  other  dichotomous 
variables  using  a  typical  2X2  contingency  table.  Three  different  criterion  levels  for  a 
positive  test  were  established  based  on  number  of  positive  findings  for  variables  in  the 
TIC.  Table  35  lists  the  diagnostic  characteristics  of  the  CTS  improved  TIC. 
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Table  35  Test  item  cluster  for  predictors  of  improvement  of  non-surgical  patients  with 
carpal  tunnel  syndrome  :  question  4-“Hand  fat/swollen..”,  Fear  Avoidance  Behavior 
Quesionaire  A  (>54%),  and  wrist  ratio  (<.70).  Sensitivity  (Sn),  Specificity  (Sp),  Negative 
Likelihood  Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination 
variables  with  95%  confidence  intervals  (95CI) 


Number  of  any 

positive  findings  in 

cluster 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

One 

.92  (.70-  1.0) 

.34  (.11  -  .34) 

.24  (.02  -  3.8) 

1.4  (.91-2.2) 

Two 

.92  (.70-1.0) 

.66  (90  -  .66) 

.13  (.01  -1.8) 

2.7  (1.3 -5.5) 

Three 

.75  (.40-1.0) 

.97  (.88-1.0) 

.26  (.06-1.0) 

24.0  (1.5-381.9) 

The  hypothesis  that  a  combination  of  clinical  examination  variables,  and/or  patient  self- 
report  items,  and/or  EMG/NCS  findings  can  be  found  to  yield  acceptable  levels  of 
predictive  validity  for  improved  non-surgical  CTS  subjects  is  accepted. 

6.5, 2, 4  Diagnostic  Characteristics  of  the  CTS  Worsened  TIC 
After  preliminary  variable  reduction,  the  following  test  variables  met  the  criteria  and 
were  initially  entered  into  the  regression  model:  Phalen’s  test,  Hand  diagram,  questions 
2-“Where  most  bothersome..”,  5-‘Tumbling/dropping..”,  4-“Hand  fat/swollen..”,  10- 
‘Tland  shaking  improves..”,  8-“Night  symptoms  wake..”,  SSS  (>2.0)  wrist  ratio  (>.70), 
and  FSS  (>2.0).  Due  to  the  large  number  of  variables,  the  co-variance  matrix  of  the 
regression  model  could  not  be  calculated.  The  model  was  re-calculated  after  one  variable 
at  a  time  was  removed.  Variables  were  removed  from  the  model  based  on  the  lowest 
LRprime  value.  After  a  total  of  five  iterations,  a  solution  to  the  model  was  reached  with 
the  following  variables  entered  into  the  model:  questions  2  and  5,  hand-diagram  score, 
wrist  ratio  (>.70),  and  SSS  (>2.0).  The  results  of  the  HL  test  indicated  the  model  fit  the 
data  (final  p=  1.0).  The  three  test  variables  chosen  by  the  model  and  therefore  considered 
the  best  CTS  TIC  include:  questions  2,  5  and  wrist  ratio  (>.70).  The  diagnostic 
characteristics  of  the  CTS  TIC  were  calculated  as  for  other  dichotomous  variables  using  a 
typical  2X2  contingency  table.  Based  on  the  LR’s  for  question  2 -“Where  most 
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bothersome..”  for  the  individual  predictors  of  worsening,  the  question  was  dichotomized 
into  positive  (iii  (arm  AE)  and  iv  (arm  BE))  and  negative  (i  (neck),  ii  (shoulder/shoulder 
blade),  and  v  (hand/fingers))  responses.  Three  different  criterion  levels  for  a  positive  test 
were  established  based  on  number  of  positive  findings  for  variables  in  the  TIC.  Table  36 
lists  the  diagnostic  characteristics  of  the  CTS  worsened  TIC. 


Table  36  Test  item  cluster  for  predictors  of  worsened  non-surgical  patients  with  carpal 
tunnel  syndrome  :  question  2-“Where  most  bothersome..”,  question  5- 
“Fumbling/dropping..",  and  wrist  ratio  (>.70).  Sensitivity  (Sn),  Specificity  (Sp),  Negative 
Likelihood  Ratio  (LR-),  and  Positive  Likelihood  Ratio  (LR+)  of  clinical  examination 
variables  with  95%  confidence  intervals  (95CI) 


Number  of  any 

positive  findings  in 

cluster 

Sn  95CI 

Sp  95CI 

LR-  95CI 

LR+  95CI 

One 

.94  (.77-1.0) 

.18  (.00-  .38) 

.35  (.02-6.4) 

1.1  (.84-1.6) 

Two 

.94  (.77-1.0) 

.75  (.52  -  .97) 

.08  (.01  -1.2) 

3.8  (1 .4  -  9.5) 

Three 

.31  (.00-6.3) 

.96  (.87-1.0) 

.71  (.44-1.2) 

8.8  (.48-160.5) 

The  hypothesis  that  a  combination  of  clinical  examination  variables,  and/or  patient  self- 
report  items,  and/or  EMG/NCS  findings  can  be  found  to  yield  acceptable  levels  of 
predictive  validity  for  worsened  non-surgical  CTS  subjects  is  accepted. 

7.0  SUMMARY  AND  CONCLUSIONS 

The  purpose  of  this  study  was  to  evaluate  the  efficacy  of  commonly  used  clinical 
examination  and  patient  self-report  measures  to  diagnose  and  predict  outcome  in  patients 
with  CR  and  CTS.  The  diagnosis  of  both  conditions  was  based  on  neural  impairment 
findings  from  a  standardized  EMG/NCS  examination.  Outcome  was  based  on  the 
patient’s  surgical  intervention  status  and  global  rating  of  change  scores.  The  study  was 
conducted  with  four  specific  aims:  1 .  To  determine  the  inter-rater  reliability  of  the 
clinical  examination  items  used  for  the  diagnosis  of  CR  and  CTS;  2.  To  to  determine  the 
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diagnostic  accuracy  of  clinical  examination  items  and  patient  self-report  measures  used 
for  the  diagnosis  of  CR  and  CTS,  To  determine  the  predictive  validity  of  these  same 
clinical  examination  and  self-report  items,  electrophysiological  examination  variables, 
and  other  selected  variables  for  surgical  intervention  and  a  measure  of  patient  self- 
reported  outcome;  and  4.  To  determine  an  optimum  and  parsimonious  test  cluster  of  the 
previously  listed  test  item  categories  for  diagnostic  and  predictive  validity  purposes. 

Although  planned  as  a  multi-center  study,  the  majority  of  subjects  (68  of  a  total  of  81) 
were  enrolled  at  a  single  medical  center  (WHMC).  The  standardized  electrophysiologic 
examination  used  to  establish  the  gold  standard  of  neural  impairment  was  performed  by  a 
total  of  nine  different  personnel,  and  results  from  standardized  clinical  examination  used 
to  calculated  diagnostic  accuracy  and  predictive  validity  were  obtained  from  eight 
different  physical  therapist  examiners.  This  limits  the  generalizability  of  the  study  results 
to  other  medical  centers  with  similar  characteristics  but  does  allow  the  results  to  be 
generalized  to  raters  within  those  facilities. 


The  gender  and  age  characteristics  of  subjects  with  CR  and  CTS  in  our  sample  is  similar 
to  that  reported  by  other  authors.274,275  The  prevalence  of  the  two  conditions  (16%  for 
CR  and  38%  for  CTS)  for  our  sample  was  consistent  with  our  expectations.  However, 
three  subjects  were  diagnosed  with  both  conditions.  The  prevalence  of  concomitant  CR 
and  CTS  in  our  sample  (18.7%)  is  higher  than  expected  based  on  prior  reports  (3  - 
5%). 263  The  reason  for  this  difference  in  prevalence  for  subjects  with  both  conditions 
may  be  attributable  to  the  standardized,  rigourous  work-up  received  by  subjects  in  this 
study.  Duration  of  symptoms  of  one  to  6  months  and  one  to  twelve  months  for  CR  and 
CTS,  respectively,  was  an  eligibility  criterion  when  into  the  study  began.  The  upper  limit 
of  this  criteria  was  eliminated  for  both  conditions  due  to  limited  subject  enrollment. 
Based  on  the  median  and  range  of  duration  of  symptoms  for  both  conditions,  it  is  clear 
that  many  individuals  in  our  study  had  symptoms  for  quite  some  time  before  they  were 
referred  for  electrophysiological  testing. 
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The  limited  number  of  CR  subjects  prevented  a  more  detailed  analysis  of  subjects  based 
on  their  EMG/NCS  subclassification.  The  diagnosis  of  CR  was  established  based  on 
spontaneous  activity  in  one  or  more  muscles  of  the  standardized  examination.  Muscles  of 
the  standardized  examination  were  adequate  to  establish  the  diagnosis  in  all  cases  except 
one.  Additional  muscles  sampled  did  provide  information  about  the  possible 
involvement  of  the  C5  nerve  root  and  C8  nerve  root  in  two  cases.  The  C6  and  C7  nerve 
roots  were  determined  to  be  involved  in  all  CR  subjects  in  this  study  and  are  the  two  root 
levels  most  commonly  involved  in  subjects  with  CR.6,276  However,  the  reason  for  the  left 
extremity  predominance  is  unknown.  The  flexor  carpi  radialis  H-reflex  (FCR  H)  was 
prolonged  in  1  subject  and  absent  on  the  involved  side  of  two  subjects  who  were 
diagnosed  with  CR.  Although  others  have  found  the  FCR  H  to  be  useful  for  diagnosis 
and  possibly  predicting  outcome  in  surgical  patients,70  it  was  not  helpful  for  diagnosis  in 
our  study.  The  inability  to  consistently  obtain  this  reflex  may  be  due  to  technical 
limitions,  physiologic  variability,  or  both  factors. 

Approximately  half  of  the  subjects  diagnosed  with  CTS  in  this  study  were  affected 
bilaterally.  All  patients  with  bilateral  CTS  had  predominance  of  symptoms  in  one 
extremity  and  it  was  from  this  extremity  that  clinical  examination  test  results  were 
obtained.  Using  both  affected  hands  from  the  same  extremity  to  increase  sample  size 
might  have  produced  dependant  test  results  and  introduced  a  confounding  factor.16 
Because  false  negative  NCS  findings  may  be  a  concern,118  the  disto-proximal  ratio 
technique  was  included  as  part  of  the  electrophysiological  examination  in  order  to 
maximize  NCS  sensitivity.116,117  This  technique  was  performed  by  all  EMG/NCS 
providers  without  difficulty.  Four  subjects  were  diagnosed  with  CTS  based  solely  on  a 
disto-proximal  ratio  abnormality. 

In  summary,  the  diagnostic  classification  of  subjects  in  this  study  is  credible  and  potential 
biases  were  controlled  for  by  the  subject  recruitment  procedure  and  electrophysiological 
examination  methodology  used.  Based  on  the  electrophysiologic  subclassification  of 
subjects,  the  sample  of  subjects  in  this  study  are  most  representative  of  a  mild  to 
moderate  spectrum  of  disease. 
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7.1  Hypothesis  #1  -  Reliability 

Surprisingly  little  has  been  published  regarding  the  reliability  of  physical  examination 
measures  for  the  upper  extremity  in  general  and  for  CR  and  CTS  in  particular.25 
Furthermore,  no  data  has  been  published  reporting  the  reliability  of  historical  items  of  the 
clinical  examination.  Perhaps  this  is  due  to  the  fact  that  an  accurate  diagnosis  can  be 
made  from  information  obtained  from  the  history  in  the  majority  of  cases,  resulting  in 
little  if  any  concern  about  reliability.28'30  However,  the  potential  for  miscommunication 
or  even  misleading  communication  between  the  clinician  and  patient  is  high,277p41  and  the 
reliability  of  specific  questions  should  be  established  to  assess  their  potential  utility  as  a 
diagnostic  test  measure. 

The  reliability  of  the  following  types  of  clinical  examination  measures  was  assessed  in 
this  study:  single  item  questions,  provocative  tests,  conventional  neurologic  examination 
procedures,  and  scaled  measurements  of  range-of-motion  and  wrist  dimensions.  Out  of 
the  54  different  clinical  examination  variables  assessed  in  this  study,  44  had  reliability 
coefficients  of  at  least  Fair  to  Good  (Kappa=  .40  -  .75)  and  ten  of  these  were  considered 
to  be  Excellent  (Kappa  >.75). 260  Of  these  ten  items,  five  were  single  item  questions. 
Many  other  single  item  questions  had  Kappa  values  approaching  the  Excellent  level  and 
all  questions  achieved  at  least  Fair  to  Good  levels  of  reliability.  For  eighteen  of  the 
variables,  the  precision  of  the  reliability  statistic  was  definitive  and  permits  a  confident 
interpretation  of  the  reliability  coefficient  point  estimate.55  Ten  clinical  examination 
measures  had  a  poor  level  of  reliability,  six  of  which  were  conventional  neurological 
examination  tests. 

Low  prevalence  of  findings  appears  to  be  an  inherent  problem  when  assessing  tests  of 
sensation  and  motor  weakness  in  patients  with  CR.154.  In  addition  to  the  low  prevalence 
of  the  condition  in  this  study,  only  certain  dermatome  levels  and  muscles  may  be  affected 
even  when  CR  is  present,21,64pp  21 1-244  further  contributing  to  a  low  observed  based  rate 
and  misleading  Kappa  value.  The  C6  and  C7  nerve  roots  were  primarily  affected  in  this 
study  and  could  explain  why  the  MMT  and  sensory  tests  of  some  neurological  levels 
were  reliable  and  why  others  were  not.  When  disease  prevalence  is  low,  subjects  are 
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relatively  homogenous  and  demonstrate  relatively  little  variation  between  ratings. 

While  this  is  a  valid  explanation  for  the  poor  reliability  of  the  C8  dermatome  and  C8 
muscle  tests,  it  would  not  explain  the  poor  reliability  for  the  C6  dermatome  and  MMT  of 
the  triceps  brachii  and  flexor  carpi  radialis,  muscles  affected  by  C6  and  C7  nerve  root 
impairment.  One  study  has  reported  Fair  to  Good  reliability  for  MMT  of  the  triceps 
brachii  and  presence  of  a  dermatome  sensory  abnormality,  regardless  of  level.154  We  also 
found  Fair  to  Good  reliability  for  detection  of  a  any  single  dermatome  sensory 
abnormality  between  raters  but  it  is  unknown  why  the  reliability  of  MMT  for  the  triceps 
brachii  and  flexor  carpi  radialis  was  found  to  be  poor  in  this  study. 

Low  observed  base  rates  for  MMT  abnormalities  were  found  in  this  study.  All  results 
from  the  rater  pair  used  in  the  reliability  analysis  were  one  response  level  apart  (between 
“F+  -  G”  and  “NT”),  with  the  exception  of  three  subjects  who  had  P-  to  F  ratings. 
Therefore,  MMT  results  were  dichotomized  into  normal  or  abnormal  findings  for  the 
involved  limb  and  reliability  was  computed  based  on  a  dichotomized  test  result.  The 
difference  in  reliability  of  MMT  for  the  abductor  pollicus  brevis  between  the  entire 
sample  of  subjects  and  the  CTS  subgroup  is  an  example  of  how  variation  can  impact  a 
reliability  statistic;  variation  must  be  present  in  order  to  be  explained  or  accounted  for. 
When  all  subjects  are  used  to  compute  the  reliability  statistic  for  the  abductor  pollicus 
brevis  muscle,  reliability  is  poor  (Kappa=  .39).  When  only  CTS  subjects  are  used, 
reliability  is  Fair  to  Good  (Kappa=  .65).  When  compared  to  the  entire  sample  of  subjects, 
the  CTS  subgroup  had  a  larger  proportion  of  subjects  with  weakness  of  the  abductor 
pollicus  brevis.  The  reliability  of  MMT  in  this  study  cannot  be  generalized  to  the  more 
commonly  used  five-level  MMT  scheme.  The  five-level  ordinal  scale  has  a  larger 
number  of  response  levels  and  would  likely  demonstrate  a  lower  level  of  reliability. 

Two  studies  have  reported  the  reliability  of  MMT  scoring  systems  of  five  or  more  levels 
but  the  results  are  reported  as  percent  agreement  or  with  a  Pearson  correlation  coefficient 
and  cannot  be  compared  with  our  results.155,156 

Surprisingly,  both  the  ULTT  A  and  B  were  found  to  have  an  Excellent  level  of  reliability. 
The  only  other  study  to  assess  the  reliability  of  this  test  found  it  to  be  poor  (Kappa= 
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.35). 154  The  ULTT  was  not  operationally  defined  in  the  study  by  Viikari-Juntura  and  is 

the  most  likely  explanation  for  the  difference  in  their  results  and  ours.154  The  Excellent 

reliability  of  the  ULTT’s  found  in  this  study  is  only  generalizable  when  the  limb  tension 

tests  are  performed  according  to  our  operational  definition.  This  is  an  important 

distinction  because  at  least  four  variations  of  ULTT’s  have  been  described.177  Our  results 

for  Spurling’s,  Distraction,  and  shoulder  abduction  tests  are  similar  to  those  of  Viikari- 

Juntura,  although  we  found  an  Excellent  level  of  reliability  for  the  distraction  test  versus 

their  finding  of  Fair  to  Good.154  The  Kappa  statistic  for  Phalen’s  test  (Kappa=  .79)  and 

Tinel’s  sign  (Kappa  A=  .47  and  B=35)  appear  to  be  different  (higher  and  lower, 

*  212 

respectively)  from  the  only  other  study  that  has  assessed  the  reliability  of.these  items. 

Since  the  procedures  were  operationally  defined,  the  reason  for  these  differences  is 
unknown.  The  reliability  of  the  CCT,  which  was  found  to  be  Excellent,  has  not  been 
previously  reported. 

Reliability  for  cervical  range-of-motion  measurements  was  lower  than  expected.  Based 
on  our  standardized  procedure  for  taking  measurements  with  a  bubble  goniometer  and  a 
previous  report,170  we  hypothesized  that  the  reliability  of  cervical  range-of-motion 
measurements  would  be  Excellent.  There  are  several  possible  reasons  for  our  findings. 
We  used  ICC  formula  (2,1)  to  compute  our  reliability  statistic.  While  Hole  reported  ICC 
values  ranging  from  .82  to  .86  (except  right  rotation  which  was  .76),  they  did  not  report 
the  ICC  formula  they  used.  Had  we  used  formula  ICC  (2,k),  our  results  would  have 
ranged  from  .77  to  .91,  very  similar  to  theirs.  Cervical  rotation  measurements  were  taken 
with  a  Universal  goniometer.  The  ICC’s  for  right  and  left  rotation  equaled  and  exceed, 
respectively,  those  reported  by  Youdas  et.al.,165  who  also  did  not  report  which  formula 
they  used  to  determine  their  ICC  value.  Based  on  visual  assessment  of  stem-and-leaf 
plots,  the  distribution  of  measurements  for  right  rotation  appears  to  be  smaller  than  for 
left  rotation.  While  this  may  explain  the  disparity  between  the  ICC’s  of  the  two 
measures,255  the  SEM’s  for  the  two  measures  do  not  appear  to  be  different  and  indicate  a 
similar  level  of  precision.  Precision  for  all  cervical  range-of-motion  measurements,  as 
indicated  by  their  SEM,  is  less  than  would  be  desired  for  clinical  decision  making.  Using 
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a  95 Cl  based  on  our  results,  the  true  value  for  a  cervical  flexion  measurement  of  65 
degrees  could  be  anywhere  from  56  degrees  to  74  degrees,  a  range  of  nearly  20  degrees. 

The  ICC  of  both  wrist  measurements  is  less  than  excellent.  A  restriction  in  range  and 
distribution  scores  appear  to  be  the  cause  for  a  lower  than  expected  level  of  reliability  for 
these  measurements.  However,  the  SEM  for  both  wrist  measures  is  small  (2. 1mm),  and 
indicates  a  high  level  of  precision  or  small  measurement  error  component  over 

254 

measurement  occasions. 

The  high  levels  of  reliability  found  for  most  clinical  examination  items  in  this  study  may 
be  due  to  several  factors.  The  primary  factors  include:  having  operationally  defined  test 
procedures;  raters  who  were  briefed  on  test  performance  prior  to  the  study;  test 
procedures  practiced  by  raters  prior  to  study  implementation;  and  a  standardized  scoring 
sheet  to  record  results.278  Secondary  factors  include  the  dedicated  examination  space  set 
aside  for  physical  therapist  raters,  allocated  time  for  examination,  and  that  the  raters 
knew  they  were  involved  in  a  study. 277pp  43‘47  Although  patient  recall  may  have 
contributed  to  the  high  reliability  of  single  item  test  questions,  this  would  have  been 
minimized  by  the  two  day  interval  between  the  first  rater  and  second  rater  questioning. 


7.2  Hypothesis  #2  -  Test  Diagnostic  Accuracy 

The  result  of  any  test  can  be  interpreted  as  an  argument  to  strengthen  or  weaken  a  disease 
hypothesis  based  on  the  information  available  from  the  patient.54  Unlike  reliability,  there 
are  no  descriptive  values  for  what  constitutes  an  “Excellent”  ‘Tair”  or  “Poor”  level  of 
diagnostic  accuracy.  The  true  value  of  a  diagnostic  test  can  be  judged  by  its 
discriminative  power,  or  contribution  to  correct  decision  making  about  the  presence  or 
absence  of  a  disorder  based  on  the  test  result.23,54 

The  criteria  set  forth  in  this  study  for  an  acceptable  level  of  test  diagnostic  accuracy  was 
based  on  the  ability  of  a  test  to  change  post-test  probabilities  of  the  disorder  at  least  1 5%. 
Based  on  these  criteria,  variables  for  which  Sn  or  Sp  were  >70  were  included.  This  was 
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done  to  prevent  excluding  potentially  useful  useful  variables.  Positive  and  negative  LR’s 
(LR+  and  LR-)  were  the  metrics  used  to  estimate  post-test  probability  changes  because 
they  summarize  the  information  of  both  Sn  and  Sp  and  thereby  represent  the 
discriminative  power  of  a  test.  Due  to  the  limited  sample  size  and  the  impact  of  diseased 
to  non-diseased  ratio  on  LR  confidence  intervals,23  the  95CI  of  LR  values  for  test  items  in 
this  study  were  wide  and  power  was  not  satisfactory  to  detect  definitive  findings. 

7,2.1  Cervical  Radiculopathy 

There  were  39  tests  for  CR  and  34  met  the  criteria  for  acceptability.  Of  the  34  tests,  all 
but  two  of  these  were  acceptable  based  on  Sp  values  alone.  A  much  smaller  number  (12) 
had  Likelihood  ratios  that  would  result  in  meaningful  post-test  probability  changes.  The 
12  tests  with  acceptable  Likelihood  ratios  were  representative  of  the  following  clinical 
examination  categories:  history=  3  (numbers  2(ii),  7,  and  9),  conventional  neurologic 
examination=  3  (MMT  deltoid,  biceps  brachii,  and  biceps  brachii  MSR),  provocative 
tests=  4  (Spurling’s  A,  Valsalva,  Distraction,  and  ULTT  A),  ROM  measurements^  1  (left 
rotation),  and  self-report  instruments  (VASworse).  Of  these  12  tests,  three  had  both  LR+ 
and  LR-  values  that  were  acceptable  (two  questions  and  left  cervical  rotation). 

The  percentage  of  patients  with  a  confirmed  diagnosis  of  CR  who  are  offered  surgical 
intervention  can  be  relatively  high  (35%),  even  on  the  initial  visit.279.  Therefore,  tests 
with  high  Sp  and  LR+  values  are  preferred.  The  majority  of  LR’s  in  our  study  that  were 
acceptable  were  LR+,  indicating  that  their  respective  tests  serve  the  intended  purpose  for 
the  examination  of  CR  patients  and  are  useful  for  diagnosis.  Many  clinicians  view 
abnormal  findings  of  the  conventional  neurological  examination  to  be  the  sin  quo  non  of 
CR,  and  the  finding  that  all  neurological  test  items  had  acceptable  levels  of  specificity 
appears  to  justify  this  position.  However,  only  three  of  the  14  neurological  test  items  had 
acceptable  LR+  values  that  would  produce  meaningful  changes  in  post-test  probability. 
The  single  question  items,  provocative  tests,  and  the  VASworse  scale  had  higher  LR+ 
values  than  two  of  three  conventional  neurological  examination  items.  Question  2- 
“Where  most  bothersome.. ”(ii  (shoulder/shoulder  blade)),  had  an  acceptable  LRi  and 
appears  to  support  the  long  held  view  that  predominate  scapular  pain  is  strongly 
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associated  with  radiculopathy.280  Question  7-“Symptoms  keep  from  falling  asleep.”  had 
the  single  best  LR+  value  (6.5),  which  was  also  the  only  definitive  Likelihood  ratio 
finding,  despite  the  limitation  in  power  described  earlier.  Based  on  the  prevalence  of  CR 
in  our  study  (16%),  a  positive  response  to  question  7  would  result  in  a  post -test 
probability  of  55%.  This  means  that  probability  of  the  patient  having  CR  has  now 
increased  39%  based  on  the  response  to  a  single  question. 

The  ULTT  A  was  found  to  have  perfect  Sn  in  this  study  and  resulted  in  a  LR-  of .  15. 

Even  with  the  low  prevalence  of  CR  in  our  study,  a  negative  result  on  this  test  would 
result  in  a  post-test  probability  of  3%  and  essentially  rule-out  CR.  The  ULTTA  is  a 
potentially  useful  screening  test  to  determine  who  will  undergo  the  time  and  expense  of 
an  EMG/NCS  test.  Cervical  rotation  to  the  involved  side  (only  the  left  side  in  this  study) 
was  also  highly  Sn  with  an  LR-  comparable  to  the  ULTT  A.  Because  the  limited  number 
of  subjects  with  right  sided  involvement,  it  is  unknown  if  rotation  to  the  involved  is  also 
Sn  for  subjects  with  right-sided  CR.  Interestingly,  the  reliability  of  left  cervical  rotation 
was  only  Good.  This  provides  an  example  of  why  a  test  with  less  than  excellent 
reliability  must  be  examined  closely  before  being  dismissed  as  useless.254 

Question  7 -“Do  your  symptoms  keep  you  from  falling  asleep  at  night?”  was  the  only  CR 
clinical  examination  test  item  with  a  definitive  LR+  finding.  One  can  be  confident,  based 
on  the  results  of  this  study,  that  the  true  LR+  value  for  a  “yes”  response  to  this  question 
lies  between  2.3  to  1 8.0.  The  LR-  values  for  the  ULTT  A  and  for  left  cervical  rotation 
were  not  definitive  and  caution  must  exercised  when  interpreting  the  results  of  these  two 
tests  due  to  the  wide  95CI. 

7,2,2  Carpal  Tunnel  Syndrome 

The  treatment  of  CTS  often  follows  a  typical  and  progressive  course  of  medication  and 
activity  modification,  splinting,  and  surgery  based  on  patient  response  to  treatement.15 
Because  surgical  intervention  for  CTS  is  the  most  common  surgical  procedure  performed 
in  the  hand,281  may  fail  to  relieve  symptoms  in  a  number  of  cases,14  and  is  often  based  on 
clinical  examination  findings,282  test  items  with  high  Sp  and  LR+  values  are  important  to 
prevent  unnecessary  intervention.  However,  useful  screening  tests  with  high  Sn  and  LR- 
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values  are  also  desired  since  CTS  is  considered  a  work-related  musculoskeletal  disorder 

,o, 

with  high  workers  compensation  costs. 

The  diagnostic  characteristics  of  clinical  examination  tests  for  CTS  will  first  be  discussed 
separately  for  the  Mild/Moderate  and  Pronounced/Severe  CTS  subclassification  groups, 
followed  by  a  discussion  of  these  same  test  items  for  the  CTS  group  as  whole.  The  same 
control  group  was  used  for  both  groups,  therefore  specificity  values  remained  unchanged 
in  both  subclassification  analyses  and  are  not  discussed. 

7 , 2 , 2 . 1 .  Subclassification  Group  Results 

For  the  Mild/Moderate  group,  1 1  of  the  32  tests  for  CTS  met  the  acceptance  criteria.  Of 
the  1 1  tests,  five  had  Likelihood  ratios  that  would  result  in  meaningful  post-test 
probability  changes.  The  five  tests  were  representative  of  the  following  clinical 
examination  categories:  history=  2  (numbers  2-“Where  most  bothersome..”  (iii  (arm  AE) 
and  10-‘Tiand  shaking  improves..”),  conventional  neurologic  examination=  1  (MMT 
abductor  pollicus  brevis),  and  self-report  instruments=  2  (SSS  and  FSS).  For  the 
Pronounced/Severe  group,  14  tests  met  the  criteria  for  acceptability.  Of  these  14  tests, 
eight  had  Likelihood  ratios  that  would  result  in  meaningful  post-test  probability  changes. 
The  eight  tests  were  representative  of  the  following  clinical  examination  categories: 
history^  4  (numbers  2-“Where  most  bothersome.. ”(v  (hand/fingers)),  3 -“Symptom 
behavior..”,  5-‘Tumbling/dropping..”,  and  10-“Hand  shaking  improves..”),  conventional 
neurologic  examination^  3  (MMT  abductor  pollicus  brevis,  median  sensory  fields  1  and 
3),  and  self-report  instruments=  (FSS).  Of  these  eight  tests,  three  had  both  LR+  and  LR- 
values  that  were  acceptable  (questions  5,  10,  and  median  sensory  field  1). 

A  comparison  of  the  groups  based  on  test  items  with  acceptable  LR  values  show  that  the 
diagnostic  characteristics  of  the  following  test  items  are  similar  for  both  groups:  FFS, 
MMT  abductor  pollicus  brevis,  and  question  10.  Two  of  these  items  address  impairment 
and  the  other  relief  of  symptoms.  Question  10,  which  asks  “Do  your  symptoms  improve 
with  moving,  “shaking”,  or  positioning  your  hands?”,  has  been  reported  in  the  literature 
as  a  provocative  test  and  called  the  “Flick  sign”.283  Although  the  Flick  sign  was  not 
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tested,  we  included  this  question  in  our  study  because  we  thought  it  to  be  reflective  of  the 
Flick  sign. 


Test  items  that  demonstrated  potentially  different  diagnostic  characteristics  between  the 
groups  and  had  acceptable  LR+’s  include  the  following:  For  the  Mild/Moderate  group 
question  2 -“Where  most  bothersome..”  (iii  (arm  AE))  and  the  SSS,  For  the 
Pronounced/Severe  group  questions  2-“Where  most  bothersome  (v  (hand/fingers)),  3- 
“Symptom  behavior  (i  (constant)),  and  5-“Fumbling/dropping”,  and  median  sensory 
fields  1  and  3 .  Questions  2  and  3  address  location  and  behavior  of  symptoms, 
respectively.  Question  5  and  median  sensory  fields  address  motor  and  sensory 
impairment,  respectively.  Question  8,  “Do  your  symptoms  wake  you  at  night”  is  of 
interest.  This  question  is  thought  to  be  highly  diagnostic  of  CTS  but  does  not  appear  to 
be  useful  in  mild  cases.  The  SSS  is  used  to  measure  symptoms  severity.  From  these 
results,  it  appears  that  severity  of  symptoms  (SSS)  and  primary  complaint  of  upper  arm 
symptoms  are  useful  for  the  diagnosis  of  Mild/Moderate  CTS,  but  not  for 
Pronounced/Severe  CTS.  Likewise,  it  appears  that  constant  symptoms  which  most 
bothersome  in  the  hand  and/or  fingers  along  with  functional  limitations  of  grasp  and 
sensory  impairment  are  useful  for  the  diagnosis  of  Pronounced/Severe  CTS,  but  not  for 
the  Mild/Moderate  CTS.  A  larger  sample  size  is  needed  to  determine  if  these  apparent 
differences  are  significant.  These  findings  are  consistent  with  the  existence  of  spectrum 
bias.50  Although  both  groups  have  several  common  symptoms  and  findings,  subjects  in 
the  less  severe  group  are  better  distinguished  by  diffuse  sensory  complaints  while 
subjects  in  the  more  severe  group  are  better  distinguished  by  localized  neurological 
impairment  and  functional  limitations  of  the  hand. 

For  the  Pronounced/Severe  group,  three  test  items  had  both  LR+  and  LR-  values  that 
were  acceptable  (Questions  5-“Fumbling/dropping.”  and  10-“Hand  shaking  improves..” 
and  median  sensory  field  1);  no  test  items  were  identified  for  the  Mild/Moderate  group. 
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7, 2. 2. 2  Collective  CTS  Group  Results 

As  expected,  the  diagnostic  utility  of  the  clinical  examination  test  items  was  diminished 
when  the  two  groups  of  subclassified  CTS  patients  were  considered  together.  A  total  of 
17  of  the  32  tests  for  CTS  met  the  acceptance  criteria.  Of  the  17  tests,  six  had  LR’s  that 
would  result  in  meaningful  post-test  probability  changes.  The  six  tests  were 
representative  of  the  following  clinical  examination  categories:  history=  3  (numbers  3- 
“Symptom  behavior..”  (i  (constant)),  5-“Fumbling/dropping..”,  and  10-“Shaking  hand 
improves..”),  conventional  neurologic  examination^  1  (MMT  abductor  pollicus  brevis), 
and  self-report  instruments=  2  (SSS  and  FSS).  In  addition  to  the  lower  number  of  LR  s 
found  acceptable  compared  to  CR,  the  discriminative  power  of  these  Likelihood  ratios 
was  reduced  when  the  two  groups  were  considered  together.  The  exception  to  this  was 
question  number  10.  The  acceptable  LR+  and  LR-  values  for  question  10  remained 
nearly  the  same  in  the  subgroup  and  total  group  analyses. 

The  single  best  LR+  value  was  the  FSS  with  a  cut-off  value  of  2.5  (LR+=  3.1);  the 
remainder  of  test  items  had  LR+  values  of  2.5  or  less.  Based  on  the  prevalence  of  CTS  in 
our  study  (38%),  an  FSS  score  of  >2.5  would  result  in  a  post-test  probability  of  66%. 
This  means  that  probability  of  the  patient  having  CTS  has  now  increased  28%  based  on 
the  score  of  this  self-report  instrument.  Question  number  10-“Hand  shaking  improves.  ” 
is  of  particular  interest  because  of  its  stability  across  group  analyses  for  both  LR+  and 
LR-  values.  Calculating  changes  in  probability  based  on  clinical  examination  test  results, 
if  a  subject  responds  to  question  10  with  a  “no”,  then  the  probability  of  CTS  being 
present  has  diminished  to  20%.  Unlike  ULTT  A  for  CR,  the  post-test  probability 
generated  by  a  negative  response  to  question  10-“Hand  shaking  improves..”  is  not 
satisfactory  to  warrant  its  consideration  as  a  screening  test  to  rule-out  CTS  and  eliminate 
the  need  for  an  EMG/NCS  examination. 

In  this  discussion  of  useful  clinical  examination  tests,  Tinel’s  sign,  the  CCT,  and  Phalen’s 
test  are  conspicuous  by  their  absence.  Neither  of  these  tests  had  accepatable  Likelihood 
ratios  in  any  of  the  analyses.  However,  the  Sn  of  the  CCT  was  .70  in  the  Mild/Moderate 
group  and  .80  in  the  Pronounced/Severe  group.  Due  to  lack  of  power  in  the  subgroup 
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analyses,  it  is  unknown  if  this  increased  Sn  value  is  significant.  However,  based  on  the 
95CI  in  the  total  group  analysis,  Sp  for  both  the  Phalen’s  test  and  the  CCT  was 
considered  definitively  unacceptable.  Sensitivity  of  Tinel’s  sign  and  Sp  for  the  hand 
diagram  and  wrist  ratio  were  also  definitively  unacceptable.  At  least  19  studies  of 
Phalen’s  and  Tinel’s  sign  have  been  conducted.  The  earlier  review  of  these  studies  in 
Table  4  and  Figure  9  listed  the  wide  range  of  Sn  and  Sp  values  reported  for  these  tests. 
The  most  methodologically  sound  of  these  studies  was  by  De  Krom  et.  al.  who  also  found 
lower  Sn  and  Sp  values  for  Phalen’s  test  and  Tinel’s  sign.'60  The  CCT  has  not  been 
studied  as  extensively  as  Phalen’s  test  and  Tinel’s  sign  but  has  a  similar  wide  range  of 
reported  diagnostic  accuracy  values  derived  from  methodologically  deficient  studies. 


Although  the  diagnostic  accuracy  values  derived  from  the  initial  retrospective  report  of 
the  hand  diagram  were  promising,223  subsequent  prospective  investigations  have  reported 
much  lower  values  of  Sn  and  Sp  that  are  similar  to  the  values  we  found  for  Sn16'152. 
However,  our  Sp  values  appear  to  be  much  lower  (.43  vs.  .71)  and  may  be  due  to  our 
control  group  which  consisted  of  CR  patients  and  patients  with  symptoms  similar  to  CR 
and  CTS.  The  Sn  and  Sp  values  for  question  5  are  similar  to  the  results  of  Katz  et.  al.16. 
Our  Sp  and  Sn  values  for  the  wrist  ratio  are  consistent  with  the  findings  reported  by 
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Kuhlman  and  Hennessey. 

7,2,3  Summary  of  Diagnostic  Accuracy  Findings 

Any  interpretation  of  the  utility  of  individual  test  items  of  the  clinical  examination 
reported  in  this  study  for  the  diagnosis  of  CR  or  CTS  must  be  done  in  the  context  of  our 
neural  impairment  gold  standard.  Although  no  gold  standard  is  perfect,40  we  considered 
neural  impairment  based  on  NCS/EMG  findings  to  be  the  optimum  reference  criterion  for 
the  purposes  of  our  study.  The  arguments  regarding  false  positive  and  negative  findings 
related  to  NCS  procedures  have  been  discussed  in  section  2.3.2  and  will  not  be  repeated 
here. 

Another  facet  related  to  gold  standard  selection  is  our  liberal  criteria  for  a  positive 
EMG/NCS  test  for  both  CR  and  CTS.  It  is  possible  that  some  subjects  were  classified  as 
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positive  for  disease  based  on  artifact  and  anomalous  or  irrelevant  findings.  Including  a 
third  group  of  asymptomatic  subjects  would  have  allowed  us  to  assess  the  potential 
impact  our  liberal  diagnositic  criteria  had  on  subject  classification.  However,  performing 
the  standardized  EMG/NCS  examination  on  entirely  asymptomatic  subjects  would  be 
difficult  to  justify  both  ethically  and  monetarily,  especially  in  light  of  findings  from 
recent  studies  using  this  design.213’284  Both  Szabo  et.al.213  as  well  as  Gerr  and  Letz284 
found  that  using  asymptomatic  subjects  as  a  control  group  produced  falsely  optimistic 
estimates  of  diagnotic  accuracy  for  the  diagnostic  tests  being  considered.  Although 
liberal,  our  criteria  were  based  on  peer-reviewed  published  literature.75'113  However, 
using  a  more  conservative  gold  standard  criterion  has  merit  since  surgical  decisions  are 
often  based  on  EMG/NCS  test  results.64^111'160’ 211'244 

The  utility  of  the  clinical  examination  was  better  for  the  diagnosis  of  CR  than  for  CTS 
based  on  the  number  of  acceptable  Likelihood  ratios,  their  discriminative  power,  and  the 
change  in  post-test  probability  for  a  single  test  item  of  the  clinical  examination.  This  may 
be  the  result  of  spectrum  bias  as  observed  in  the  subclassification  analyses.  Identifying 
characteristics  of  subjects  most  probable  to  be  classified  as  Pronounced/Severe50  would 
increase  the  diagnostic  utility  found  for  several  test  items  in  this  study.  The  impact  of 
spectrum  bias  on  diagnostic  tests  for  CR  is  unknown  and  was  not  assessed. 

The  utility  of  single  item  test  questions  for  both  conditions  is  remarkable  when  compared 
to  other  categories  of  clinical  examination  items  and  is  consistent  with  previous  reports 

28-30 

regarding  the  diagnostic  power  of  a  patient’s  condition  history  and  symptoms. 

However,  no  single  test  item  resulted  in  a  post-test  probability  greater  than  66%  for  either 
condition.  If  maximum  specificity  is  desired  for  these  two  conditions,  more  definitive 
test  procedures  must  be  performed.  Similarly,  the  test  item  with  the  best  single  LR-  for 
CTS  did  not  result  in  adequate  post-test  probabilities  to  be  considered  useful  as  a 
screening  tool.  The  ULTT  A  appears  to  be  a  potentially  excellent  and  useful  screening 
test  for  CR  but  further  study  with  a  larger  sample  is  required  to  establish  a  definitive  LR- 
value. 
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7.3  Hypothesis  #3  -  Test  Predictive  Validity 

The  use  of  patient  outcome  as  a  gold  standard  attempts  to  define  the  properties  of  a 
diagnostic  test  at  the  highest  level  of  efficacy.40  The  use  of  an  outcome  reference 
criterion,  or  gold  standard,  in  conjunction  with  diagnostic  test  methodology  and  indices 
has  been  used  to  measure  the  responsiveness  of  HS AM’s285  and  more  recently  to  predict 
pain  relief  from  splinting  intervention  in  subjects  with  CTS.286  Surgical  intervention 
status  was  used  as  one  outcome  criterion  measure  and  change  based  on  a  GRCS  as  the 
other  outcome  criterion  measure.  Clinical  examination  items  for  CR  were  not  assessed 
using  these  outcome  criteria  but  both  outcome  gold  standards  were  assessed  in  patients 
with  CTS.  The  results  of  all  three  predictive  validity  analyses  for  CTS  patients  must  be 
considered  preliminary  due  to  the  small  sample  size,  low  prevalence  of  surgical 
intervention  and  patients  classified  as  changed,  and  incomplete  follow-up  of  81%. 

7.3.1  Surgical  Intervention  Gold  Standard 

Surgical  intervention  is  not  based  solely  on  the  decision  of  the  surgeon  and  the  patient; 
both  individuals  have  significant  input  into  the  decision  to  operate.  Even  though  the 
decision  to  intervene  surgically  is  probably  multi-factorial  in  nature,  it  is  possible  that 
several  overriding  factors  influencing  this  decision  can  be  identified.  Because  all  patients 
with  positive  EMG/NCS  test  results  could  be  considered  potential  surgical  candidates, 
test  with  a  high  specificity  are  desired.  Positively  identifying  which  patients  will  receive 
surgery  based  on  the  results  of  clinical  examination  findings  and  other  predictor  variables 
would  allow  a  closer  examination  and  further  investigation  of  those  variables. 

The  use  of  EMG/NCS  test  results  as  test  variables  could  be  considered  a  form  of 
inclusion  bias.  However,  not  all  surgeons  base  surgical  intervention  on  EMG/NCS  test 
results282  and  it  is  unknown  which  EMG/NCS  test  items,  if  any,  are  strong  predictors  of 
surgical  intervention.  Therefore,  we  included  several  median  nerve  EMG/NCS 
parameters  as  predictor  variables  along  with  age,  duration  of  symptoms,  and  FABQ 


scores. 


176 


The  predictive  validity  of  32  clinical  examination  test  items  was  evaluated  in  addition  to 
seven  additional  predictor  variables.  Twenty  of  the  test  items  and  all  of  the  additional 
predictor  variables  met  the  criteria  for  acceptability  for  a  total  of  27  items.  There  were  17 
items  that  had  acceptable  Likelhood  ratios,  1 1  were  clinical  examination  items  and  6 
were  additional  predictor  variables.  Acceptable  Likelihood  ratios  were  representative  of 
the  following  clinical  examination  categories:  history=  5  (numbers  l-“Most  bothersome 
symptoms  (iii  (loss  of  feeling)),  5-“Fumbling/dropping..”,  10-“Hand  shaking  improves..”, 
and  1  l-”Worse  with  hand  use..”),  conventional  neurologic  examination=  2  (median 
sensory  field  1  and  MMT  abductor  pollicus  brevis),  provocative  tests=  1  (Tinel’s  B), 
scaled  measurement=  2  (wrist  ratio  =  >.70  and  >.73),  self-report  instruments=l  (FSS 
>2.3).  The  only  additional  predictor  variables  that  did  not  result  in  an  acceptable 
Likelihood  ratio  were  the  FABQ  A  and  B  scales.  Of  these  17  variables,  three  had  both 
LR+  and  LR-  values  that  were  acceptable  (MMT  abductor  pollicus  brevis,  wrist  ratio 
>.73,  and  spontaneous  EMG  activity  in  the  abductor  pollicus  brevis). 

The  abductor  pollicus  brevis  motor  amplitude  (<4800uv)  and  presence  of  spontaneous 
activity  in  the  EMG  produced  the  two  largest  LR+  (17.5  and  1 1.4,  respectively).  The 
next  largest  LR+  (10.5)  was  question  l-“Most  bothersome  symptoms. .”  (iii),  indicating 
that  the  patient’s  most  bothersome  symptom  was  loss  of  feeling.  The  remaining  LR+ 
values  were  3.5  or  less.  The  two  variables  with  the  largest  LR+  are  related,  so  only  one 
should  be  used  to  estimate  the  post-test  change  in  probability.  Based  on  the  pre-test 
probability  of  surgery  for  the  25  subjects  for  who  returns  were  available  (20%),  a  subject 
with  a  motor  amplitude  of  <4800uv  now  has  an  81%  chance  of  having  surgery;  a  sizable 
60%  increase  in  probability. 

A  number  of  test  items  items  had  small  LR-  values  indicating  that  unless  these  variables 
are  present,  surgical  intervention  is  not  likely.  The  smallest  of  these  were  all  questions 
(numbers  5-‘Tumbling/dropping..”,  10-“Hand  shaking  improves..”,  and  ll-“Worse  with 
hand  use..”)  with  LR-  values  ranging  from  .04  to  .08.  Duration  of  symptoms  less  than  78 
days  and  age  <  48  years  also  had  a  low  LR-  value  (.19  and  .  14,  respectively).  These  data 
indicate  that  surgical  intervention  will  not  be  performed  if  the  subject  does  not  have  some 
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component  of  symptoms  that  are  most  bothersome  in  the  hand,  complaints  of  functional 
limitations,  is  able  to  affect  symptoms  with  hand  movement,  and  if  the  symptoms  have 
not  been  present  longer  than  78  days.  It  also  indicates  that  surgery  is  much  less  likely  to 
be  performed  on  older  subjects  during  this  short-term  follow-up  period.  Using  the  single 
best  test  item  LR-  to  calculate  post-test  odds,  a  negative  response  to  question  1 0 
diminishes  the  probability  of  surgery  diminishes  from  20%  to  5%. 

These  data  suggest  the  EMG/NCS  parameters  related  to  the  loss  of  motor  units  are  the 
strongest  predictors  for  surgery,  which  is  expected  because  patients  in  our  study  were 
referred  from  surgeons  who  utilize  EMG/NCS  testing  for  the  management  of  patients 
with  CTS.  However,  the  number  of  single  question  items  with  highly  predictive  Sn  and 
LR-  values  was  not  expected.  Surgical  intervention  for  the  treatment  of  CTS  appears  to 
be  based  on  several  facets,  which  include  impairment,  symptoms,  and  functional 
limitations.  These  results  may  be  useful  as  a  starting  point  for  further  quantifying 
specific  items  that  represent  these  facets  and  how  they  may  be  used  to  predict  which  CTS 
patient  will  be  treated  surgically.  This  type  of  prognostic  information  would  be  useful  to 
clinicians  treating  CTS  patients  conservatively  for  knowing  when  to  terminate  treatment, 
and  for  surgeons  in  order  to  analyze  their  treatment  decisions. 

7.3.2  Change  Gold  Standard 

Because  only  a  few  of  the  20  non-surgical  CTS  groups  experienced  change  in  either 
direction  (improved  or  worsened),  we  lowered  to  threshold  of  change  from  >  +/-  four  or 
more  points  to  >  +/-  three  or  more  points  on  the  GRCS.  We  justified  our  change  in 
criteria  based  on  our  interest  in  the  patients  self-reported  perception  of  change  and  that 
our  original  criteria  may  have  been  too  stringent  to  detect  meaningful  change  in  a  six- 
week  time  period.  The  results  of  the  predictive  validity  analyses  and  the  descriptive 
results  from  the  HSAM’s  of  the  patients  who  considered  themselves  worsened  coincide 
with  what  would  be  expected  in  a  worsened  patient.  Unfortunately,  the  predictive 
validity  results  for  the  improved  group  were  often  paradoxical  and  somewhat  difficult  to 
interpret.  The  predictive  validity  results  for  patients  who  considered  themselves 
worsened  will  be  discussed  first,  followed  by  the  results  of  the  “improved”  group. 
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7.3.2. 1  Worsened  Patients 

For  the  identification  of  subjects  who  will  worsen,  tests  with  good  screening  properties 
are  desirable  to  maximize  identification  of  subjects  with  a  potentially  poor  or  adverse 
outcome.287  For  the  subjects  considered  worsened,  the  predictive  validity  of  30  clinical 
examination  tests  items  was  evaluated  in  addition  to  one  additional  predictor  variable. 
Age,  duration  of  symptoms,  EMG/NCS  tests  items  (except  spontaneous  activity  in  the 
abductor  pollicus  brevis  muscle),  and  the  FABQ  questionnaire  were  eliminated  from 
further  consideration  as  predictive  variables  by  the  ROC  curve  analysis.  Seventeen  of  the 
test  items  met  the  criteria  for  acceptability  and  12  of  these  had  acceptable  Likelhood 
ratios.  Acceptable  Likelihood  ratios  were  representative  of  the  following  clinical 
examination  categories:  history=  5  (numbers  2 -“Where  most  bothersome..”  (iii  (arm  AE) 
and  iv  (arm  BE)),  5-‘Tumbling/dropping..”,  8-“Night  symptoms  wake..”,  and  10-), 
provocative  tests=  2  (Phalen’s  test  and  ULTT  B),  scaled  measurement^  1  (wrist  ratio  = 
>.70),  and  self-report  instruments33  4  (FSS  >2.0,  SSS  >2.0,  VAS  >4.9,  and  hand  diagram 
(1-3=  positive  test)).  There  were  no  conventional  neurologic  examination  test  items  that 
were  useful  predictors  of  worsening. 


Several  tests  had  predictive  validity  properties  useful  for  screening.  Question  5- 
“Fumbling/dropping.”  and  the  hand  diagram  (1-3  positive)  both  had  an  LR-  value  of  .16. 
Question  5  also  had  a  similar  LR-  value  of .  16  for  predicting  surgical  intervention.  If  the 
hand  diagram  or  question  5  is  used  for  screening  which  patients  will  worsen,  the  pretest 
probability  of  a  patient  in  this  sample  changes  from  35%  to  9%  when  a  negative  test 
result  is  obtained  (i.e.,  pain  drawing  is  not  consistent  with  median  nerve  impairment). 
Other  items  with  good  screening  properties  represented  the  constructs  of  a  irritability 
(VASnow,  SSS,  Phalen’s,  ULTT  B),  functional  limitation  (question  5  and  FSS)  and 
possible  anatomical  predisposition  to  the  condition  (wrist  ratio).  Overall,  the  predictive 
items  most  powerful  for  subjects  who  worsen  indicate  intense  forearm  pain  and 
complaints  of  functional  limitations  consistent  with  median  nerve  impairment. 

Test  items  diagnostic  of  worsening  were  pain  in  the  forearm  or  upper  arm  and  a  high 
VASnow  rating.  A  positive  response  to  question  number  2-“Where  most  bothersome.  ” 
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(iv  (arm  BE)),  would  increase  the  probability  of  a  patient  in  this  study  worsening  from 
35%  to  87%.  A  test  that  yields  this  information  could  be  useful  for  clinicians  for  the 
purposes  of  prognosis  and  decisions  regarding  continued  or  more  aggressive  conservative 
intervention  and  deciding  when  conservative  treatment  options  have  been  exhausted. 

7  3  2.2  “Improved”  patients 

Descriptive  results  from  the  HSAM’s  of  the  patients  who  considered  themselves 
improved  do  not  coincide  with  what  would  be  expected  in  an  improved  patient.  There 
are  a  couple  of  possible  reasons  for  this.  The  first  reason  is  that  these  patients  are  not 
actually  improved.  An  improvement  of  >  +/-  3  points  on  the  GRCS  may  be  only  a 
reflection  of  a  patients  desire  to  please  the  investigator2T7p41  or  measurement  error.  A 
comparison  of  the  FSS  and  SSS  scores  of  patients  considered  improved  and  not  improved 
reveals  that  the  improved  groups  scores  are  numerically  higher  and  at  least  no  different. 
One  study  reported  that  only  13%  of  conservatively  treated  CTS  patients  were  completely 
resolved  at  one  year  follow-up.15  If  these  results  were  generalized  to  our  patients,  few  if 
any  patients  would  be  improved  given  the  short-term  follow-up  in  our  study  and  lowering 
the  criteria  for  improved  status  was  not  valid.  A  second  reason  could  be  that  this  group 
of  patients  actually  consider  themselves  to  be  improved  despite  continued  symptoms  and 
impairment.  It  is  well  known  that  in  many  instances  symptoms,  impairment,  and 
functional  limitations  are  not  well  correlated.131,135  It  is  unknown  why  the  GRCS  did  not 
perform  similarly  when  used  to  determine  patients  with  a  worsened  status.  It  is  possible 
that  the  scale  has  different  measurement  properties  for  opposite  ends  of  the  scale.288 
Based  on  the  information  available,  the  patients  classified  as  “improved”  should  be 
considered  to  be  unchanged  at  best  and  still  experiencing  a  considerable  proportion  of 
their  original  symptoms.  The  following  predictive  validity  of  this  group  will  now  be 
discussed  from  this  perspective. 

Positively  identifying  patients  who  improved  would  serve  to  prevent  further  unnecessary 
testing  and  treatment.  However,  in  this  group  of  patients  with  persistent  symptoms,  a  test 
with  good  screening  properties  would  be  desirable  for  the  same  reasons  given  for 
detecting  patients  whose  status  will  worsen.  Thirty-one  clinical  examination  tests  items 
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was  evaluated  in  addition  to  the  following  five  additional  predictor  variables:  Age, 

FABQ  A,  median  motor  and  sensory  amplitudes,  and  spontaneous  activity  in  the  abductor 
pollicus  brevis. 

Twenty-six  of  the  test  items  met  the  criteria  for  acceptability  and  18  of  these  had 
acceptable  Likelhood  ratios.  Acceptable  Likelihood  ratios  were  representative  of  the 
following  clinical  examination  categories:  history^  5  (numbers  2-“Where  most 
bothersome.”  (i  (neck)  and  ii  (shoulder/shoulderblade)),  4-‘Tland  fat/swollen..”,  6- 
“Entire  limb  numb..”,  and  10-“Hand  shaking  improves..”),  conventional  neurologic 
examination  test  items=2  (median  sensory  fields  1  and  2),  provocative  tests=  4  (Phalen’s 
test,  CCT,  Tinel’s  A,  and  ULTT  B),  scaled  measurement=  1  (wrist  ratio  =  <  70),  and  self- 
report  instruments=  4  (FSS  >1.7,  SSS  >3.0,  VAS  >3.4,  and  FABQ  >54%),  EMG/NCS 
variables^  2  (median  palmar  amplitude  >4.3uv  and  median  motor  amplitude  >81  lOuv). 
Age  of  >40  also  had  an  acceptable  LR-  value. 

The  SSS  and  the  FSS  instruments  both  had  low  LR-values  for  this  group  as  well  as  the 
group  of  CTS  patients  whose  status  worsened.  Other  variables  with  low  LR-  values 
common  to  both  groups  include  the  Phalen’s  test  and  the  ULTT  B.  The  variable  with  the 
most  useful  LR-  value  not  common  to  both  groups  was  the  FABQ  A  score.  An  FABQ  A 
score  of  less  than  54%  drops  the  pretest  probability  of  being  in  this  group  to  a  post-test 
probability  of  3%.  Question  6  was  another  test  items  thought  to  represent  a 
psychological  construct  that  had  a  similarly  low  LR-  value.  The  reduction  of  relatively 
high  levels  of  initial  psychologic  distress  may  account  for  why  patients  perceive 
themselves  to  be  improved  despite  persistant  symptoms  and  functional  limitations. 

The  many  similar  test  items  with  high  predictive  values  for  this  group  and  the  CTS 
worsened  group  are  consistent  with  the  conclusion  that  the  “improved”  group  really 
represents  patients  who  experience  a  reducition  in  their  initial  high  level  of  psychologic 
distress  but  have  little  change  in  symptoms  and  function.  Based  on  the  test  items,  it 
appears  that  predictors  for  subjects  who  consider  themselves  improved  are  characterized 
by  symptom  predominance  (SSS  and  VASnow)  of  an  axial  location  (question  2-“Where 
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most  bothersome..”  (i  (neck)  and  ii  (shoulder/shoulder  blade)),  irritability  (provocative 
testing),  and  psychologic  distress  (FABQ  A,  Question  6-“Entire  limb  numb..”)  but 
limited  axonal  loss.  Indeed,  testing  “normal”  for  median  motor  and  sensory  amplitudes 
both  had  LR+  and  LR-  values  that  were  acceptable.  The  wrist  ratio  indicates  that 
anatomical  predisposition  may  be  useful  predictive  factor. 

7.3.3  Predictive  Validity  Summary 

The  preliminary  findings  for  predictors  of  surgical  intervention  and  worsened  CTS 
subjects  both  had  face  validity  and  may  be  helpful  for  future  research.  Interpretation  of 
the  subjects  predicted  to  be  improved  was  more  difficult;  most  likely  these  subjects  were 
unchanged  and  still  symptomatic.  Predictor  test  items  unique  to  this  group  when 
compared  with  the  worsened  group  are  items  that  relate  to  psychological  constructs  and 
axonal  injury  markers;  the  reason  for  the  former  observation  is  unclear.  Test  items  related 
to  more  severe  functional  limitation  seemed  to  be  more  predominating  for  CTS  worsened 
subjects.  For  all  three  outcome  gold  standards,  the  wrist  ratio  appears  to  be  a  useful 
predictor  variable.  A  narrow  carpal  canal  (ratio  >.70)  is  predictive  of  subjects  with  a 
worsened  status  or  subjects  treated  surgically.  An  acceptably  wide  carpal  canal  (ratio 
<.70)  is  predictive  of  subjects  in  the  “improved”  or  unchanged  group.  Further  work  is 
needed  to  clarify  this  relationship  and  determine  more  precise  diagnostic  accuracy 
characteristics. 

7.4  Hypothesis  #4  -  Test  Item  Cluster 

How  does  one  choose  to  incorporate  the  numerous  results  obtained  from  the  clinical 
examination?  Just  a  cursory  glance  at  the  tables  listing  the  diagnostic  accuracy  values  for 
tests  related  to  CR  or  CTS  gives  some  indication  that  making  an  intelligent  decision 
about  which  test  results  to  use  can  be  a  daunting  task.  Knowingly  or  unknowingly,  this 
decision  is  made  every  time  a  clinician  treats  a  patient  with  CR  or  CTS.  Some  test  results 
may  be  highly  relevant  while  others  will  have  very  little  or  no  utility.  An  additional 
problem  is  that  many  of  these  tests  are  likely  to  be  conditionally  dependant;  that  is  the 
probability  of  the  outcome  of  one  test  is  affected  by  the  outcome  of  others.23  The  use  of 
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multiple,  conditionally  dependent  tests  can  result  in  inaccurate  diagnosis  and  lead  to 
further  inappropriate  testing  and  treatment. 

Often,  attempts  to  combine  individual  test  items  into  a  single,  powerful  test  item  cluster 
appear  to  have  been  a  mere  afterthought  in  an  effort  to  improve  the  mediocre  predictive 
power  of  single  item  test  results.21  Other  investigators  have  performed  fairly 
sophisticated  multivariate  analysis  for  the  purpose  of  identifying  an  optimum  predictive 
model,  but  only  a  few  common  examination  test  items  and  self-report  instrument  were 
used  and  the  results  were  reported  as  Odds  Ratios. 16  Presently,  no  systematic  attempt  has 
been  made  to  identify  which  of  the  many  clinical  examination  variables  are  most  useful 
for  the  diagnosis  of  CR  and  CTS  and  quantify  their  utility;  this  was  the  purpose  of  our 
study.  All  variables  thought  to  be  potentially  useful  for  the  diagnosis  of  the  two 
conditions  were  considered  in  our  study,  including  data  typically  thought  to  be  “soft”.289 

7.4.1  Variable  Selection  Methodolev 

Following  a  preliminary  variable  reduction  procedure  based  on  diagnostic  utility,  a 
logistic  regression  procedure  was  used  for  the  purpose  identifying  a  parsimonious  single 
TIC  for  the  conditions  of  interest.23  This  approach  minimizes  bias,  minimizes  conditional 
dependence,23  and  identifies  variables  that  may  have  marginal  individual  predictive 
power  but  make  unique  contributions  to  a  predictive  model.  The  TIC  identified  by  the 
model  was  then  used  to  compute  familiar  and  useful  indexes  of  diagnostic  accuracy. 

7. 4. 2  Diagnostic  Accuracy  TIC’s 

The  CR  TIC  consisted  of  question  9-tcNeck  movement  improves..”,  Valsalva,  biceps 
brachii  MSR,  and  Distraction.  The  CTS  TIC  consisted  of  questions  10-“Hand  shaking 
improves.”  and  3-“Symptom  behavior..”,  the  SSS,  and  MMT  of  the  abductor  pollicus 
brevis.  As  expected,  the  CR  and  CTS  TIC’s  yielded  LR+  values  that  were  much  greater 
than  any  of  the  individual  test  items  alone;  over  four  times  larger  for  the  former  condition 
and  over  twice  as  large  for  the  later.  In  this  sample  with  a  CR  prevalence  of  1 6%,  if  the 
single  best  test  item  is  used  to  diagnose  CR  (question  7-“Symptoms  keep  from  sleep..”, 
LR+=  6.5),  a  positive  response  results  in  a  post-test  probability  of  only  55%  and  the 
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diagnosis  of  CR  is  now  just  a  little  better  than  chance.  However,  a  positive  response  to 
the  CR  TIC  using  a  criterion  of  any  three  abnormal  items  (LR+=  30.5),  we  obtain  a  post¬ 
test  probability  of  85%  and  the  confidence  of  a  definitive  diagnosis  has  greatly  increased. 
Using  a  similar  example,  if  the  single  best  test  item  is  used  to  diagnose  CTS  (FSS  >2.5, 
LR+=  3. 1),  a  positive  response  results  in  a  post-test  probability  of  66%.  Sixty-six  percent 
is  better  than  the  pretest  probability  in  this  sample  of  38%,  but  still  not  sufficient  to 
establish  a  diagnosis.  If  a  positive  response  to  the  CTS  TIC  using  a  criterion  of  any  three 
abnormal  items  is  used  (LR+=  7.1),  we  have  now  increased  our  post-test  probability  to 
81%. 


When  a  criterion  of  any  single  abnormal  finding  was  used,  the  CTS  TIC  resulted  in  a 
very  low  LR-  value  (.14),  which  is  useful  for  screening  purposes.  This  value  is  over  2.5 
times  smaller  than  the  LR-  value  of  any  single  test.  This  is  extremely  helpful  because 
only  two  individual  CTS  test  items  had  LR-  values  that  met  the  criteria  of  acceptability. 

A  LR-  value  of  this  magnitude  is  powerful  and  results  in  a  post-test  probability  of  8% 
when  a  criterion  of  any  one  finding  is  taken  as  abnormal  CTS  TIC  and  the  CTS  TIC  was 
negative.  A  criterion  of  any  single  abnormal  finding  for  the  CR  TIC  resulted  in  an  LR-  of 
.27,  which  is  not  as  powerful  as  the  LR-  associated  with  the  ULTT  A  alone  (LR-=  .15). 

The  CR  TIC  findings  raise  a  couple  of  important  observations.  The  first  is  that  neither  of 
the  test  items  with  the  single  best  LR+  and  LR-  values  is  included  in  the  TIC.  This  is 
because  the  logistic  regression  model  attempts  to  maximize  correct  classification  using  a 
predefined  probability  level  (usually  .50),  without  regard  for  direction  of  error  (i.e.  false 
positive  or  false  negatives).  The  inclusion  of  apparently  marginal  or  less  powerful 
variables  may  occur  in  order  to  maximize  correct  classification. 272p  117  This  leads  to  the 
second  observation:  the  diagnostic  characteristics  of  an  individual  test  item  may  be 
sufficiently  powerful  by  itself  for  diagnosis  or  screening.  This  is  unusual  but  can 
occur277p  128  The  ULTT  A  had  an  LR-  value  of .  15  in  this  study.  The  post-test 
probability  that  results  when  the  test  is  negative  is  3%  which  many  would  consider 
sufficient  for  screening  purposes  given  the  natural  history  of  CR.  When  used  for 
screening  purposes,  a  negative  CR  TIC  would  result  in  a  posttest  probability  of  5%. 
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The  individual  test  items  of  the  CR  TIC  were  no  surprise,  and  are  consistent  with  clinical 
experience  and  what  has  been  proposed  to  be  useful  for  the  diagnosis  of  CR.34  This  not 
the  case  for  the  CTS  TIC.  With  the  exception  of  the  abductor  pollicus  brevis  MMT, 
many  test  items  thought  to  be  predictive  of  CTS  were  not  included  in  the  TIC.  Items 
absent  were  not  only  the  traditional  provocative  tests  (Phalnen’s,  CCT,  Tinel’s  sign),  but 
the  single  item  question  related  to  waking  night  pain.  This  lack  of  utility  is  not  explained 
by  spectrum  bias,  as  none  of  these  tests  were  helpful  in  the  subclassified  groups  of  CTS 
patients.  Given  the  wide  range  of  diagnostic  characteristic  reported  for  these  test  items 
and  poor  methodology  discussed  previously,  this  is  really  no  surprise.  Other 
investigators  have  also  found  waking  night  pain  to  have  no  utility  for  the  diagnosis  of 
CTS. 16  The  wrist  ratio  was  not  helpful  for  diagnosis  when  considered  by  itself  but  was 
useful  when  considered  in  conjunction  with  other  test  items.  This  test  item  is  of 
particular  interest  because  it  was  ubiquitous  throughout  all  the  CTS  TIC’s. 

7.4.3  Predictive  Validity  TIC’s 

As  for  the  single  item  predictive  validity  analyses,  the  predictive  TIC’s  in  this  study  must 
be  considered  preliminary  due  to  the  extremely  small  sample  size  and  prevalence  of  the 
condition  of  interest.  Even  so,  the  surgery  and  worsened  TIC’s  have  face  validity  that 
lends  credibility  to  the  results. 290pp  1  ~5, 141 ' 166  For  example,  subjects  whose  predominate 
complaint  is  loss  of  feeling  located  in  the  hand  and/or  fingers  and  have  a  weak  abductor 
pollicus  brevis  are  usually  considered  likely  surgical  candidates.149  An  excessively  small 
wrist  diameter  (>.73)  was  also  identified  as  predictive.  Anatomical  predisposition  to 
median  nerve  compromise  within  the  carpal  tunnel  could  certainly  contribute  to  persistent 
signs  and  symptoms  that  are  unlikely  to  resolve  spontaneously  and  that  are  unresponsive 
to  conservative  intervention.  A  criterion  of  three  positive  items  increases  the  probability 
of  surgery  from  20%  to  81%  during  the  short-term  follow-up  period  of  this  study.  When 
the  criterion  of  two  positive  items  is  used  and  the  TIC  is  negative,  the  chance  of  surgery 
is  reduced  to  only  2%.  The  diagnostic  properties  of  this  TIC  can  be  influenced  by  many 
facets  and  could  change  when  used  with  different  population  and  different  surgeons. 
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The  three  items  of  the  CTS  worsened  TIC  also  have  face  validity  and  suggests  a  physical 
basis  for  a  worsened  status  in  these  subjects.  Pain  that  is  most  bothersome  in  the  arm  and 
forearm  of  CTS  patients  is  not  uncommon  and  is  considered  a  sign  of  increased 
irritation.168  Question  5-‘Tumbling/dropping..”  indicates  patients  have  a  significant 
degree  of  functional  limitation  and  possible  loss  of  sensory/motor  function.  For  the 
worsened  TIC,  there  appears  to  be  an  anatomical  predisposition  for  subjects  who  progress 
to  a  worsened  status  (wrist  ratio  >.70).  Using  a  criterion  of  two  items  for  a  positive  test, 
the  probability  of  a  non-surgical  subject  becoming  worsened  in  this  sample  changes  from 
35%  to  4%  when  the  TIC  is  negative.  Based  on  this  TIC,  subjects  without  a  2-item 
combination  of  either  arm/forearm  pain,  complaints  of  clumsy  hands  or  dropping  things, 
or  an  abnormal  wrist  ratio  are  very  unlikely  to  progress  to  a  worsened  status  over  the 
short-term  follow-up  period  in  this  study.  If  the  criterion  for  positive  is  all  three  items, 
then  a  patient  with  a  positive  test  has  an  83%  chance  of  becoming  worsened.  Patients 
considered  to  be  at  risk  for  worsening  may  merit  closer  monitoring  and  the  decision  for 
earlier  aggressive  therapy  might  be  considered. 

Because  subjects  in  the  “improved”  group  were  considered  to  be  unchanged  subjects  with 
persistent  symptoms,  no  pre  and  post-test  probability  examples  are  given.  The  inclusion 
of  the  FAB  Q  A  may  reflect  some  degree  of  psychological  distress  in  these  patients  which 
was  expected  to  be  present  in  patients  with  a  worsened  status.  However,  this  relationship 
could  change  with  a  longer  follow-up  period.  Question  4-“Hand  fat/swollen.  ”  represents 
sensory  disturbance  as  being  predictive  as  do  self-report  scales  of  symptoms  and 
function.  Once  again,  the  wrist  ratio  (<70)  was  a  useful  predictive  test  in  this  group  and 
coincides  with  the  abnormal  wrist  ratio  values  that  were  predictive  of  the  surgical  and 
worsened  subjects. 

7,4.4  TIC  Summary 

In  this  study,  an  optimum  cluster  of  test  items  had  better  diagnostic  power  than  any  single 
test  item  alone  for  the  CR  and  CTS  conditions.  The  exception  was  the  ULTT  A  when 
used  for  screening  purposes.  Our  method  of  adjustment  for  zero-cell  findings  in 
conjunction  with  our  sample  size  resulted  in  conservative  Likelihood  ratio  estimates  for 
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the  CR  TIC  and  the  predictive  validity  TIC’s.  Depending  on  the  criterion  used  to 
determine  a  positive  test,  these  TIC’s  resulted  in  perfect  (1.0)  Sn,  Sp,  or  both.  The 
sample  size  of  this  study  produced  wide  95CI’s  and  resulted  in  only  two  definitive  LR 
findings.  Further  testing  with  a  larger  sample  size  is  required  to  know  if  the  individual 
test  items  selected  for  the  diagnostic  TIC’s  are  most  predictive  and  to  increase  the 
precision  of  Likelihood  ratio  point  estimates.  The  predictive  validity  TIC’s  in  this  study 
can  provide  a  basis  for  the  identification  of  tests  and  TIC’s  that  are  useful  for  the 
prognosis  of  treatment  and  outcome  in  patients  with  CTS. 

7.5  Summary 

This  is  the  first  study  to  concomitantly  compare  the  diagnostic  characteristics  of  clinical 
examination  items  and  self-report  instruments  for  the  diagnosis  of  CR  and  CTS,  two 
conditions  that  may  produce  similar  signs  and  symptoms.  These  two  conditions  can 
cause  differential  diagnostic  dilemmas. 

This  study  has  several  strengths.  Including  subjects  with  one  condition  in  the  control 
group  of  the  other  effectively  challenged  the  diagnostic  characteristics  of  the  tests  that 
were  considered.  Bias  was  controlled  for  in  several  ways.  First,  a  common,  independent 
gold  standard  representing  a  construct  of  interest  (neural  impairment)  was  applied 
uniformly  to  every  patient  in  the  study  to  prevent  work-up  and  inclusion  bias.  Second, 
the  EMG/NCS  provider  and  the  physical  therapy  raters  were  blinded  to  each  other’s 
results  to  prevent  test  review  and  diagnostic  bias.  Third,  subclassification  of  CTS 
subjects  permitted  an  assessment  of  spectrum  bias  its  effect  on  the  diagnostic 
characteristics  of  test  used  for  the  diagnosis  of  CTS.  The  methodological  rigor  of  this 
study  increases  the  internal  validity  of  our  results.  The  number  of  different  raters  that 
contributed  data  from  which  diagnostic  indices  were  calculated  increases  the 
generalizabilty  of  our  study’s  findings.  Assessing  the  reliability  of  tests  used  in  our  study 
permitted  a  closer  examination  of  tests  that  performed  poorly  and  did  not  exclude  test 
with  Poor  or  Fair  reliability  from  consideration.  All  clinical  examination  procedures  in 
this  study  were  operationally  defined 


187 


There  are  also  several  limitations  of  our  study  that  must  be  acknowledged.  The  first  is 
small  number  of  subjects  with  CR  that  were  included  in  the  study.  A  larger  number  of 
CR  subjects  would  most  likely  increase  the  variability  of  findings  and  could  diminish  the 
diagnostic  characteristics  we  found  for  the  test  items  in  this  study.  Based  on  the  severity 
of  EMG/NCS  findings,  most  CR  subjects  were  considered  to  be  mild.  However,  severe 
cases  usually  increase,  not  decrease  the  diagnostic  characteristic  of  a  test.50  In  addition, 
our  findings  were  only  for  subjects  with  involvement  of  the  C6-C7  nerve  root.  Second, 
the  sample  size  in  this  study  did  not  permit  precise  estimates  for  most  diagnostic  indices. 
Although  several  Sn  and  Sp  values  were  considered  definitively  acceptable,  only  one 
individual  CR  test  item  and  two  TIC’s  (CR  and  surgery)  had  definitive  LR+  values. 

None  of  these  LR+  values  would  produce  large  changes  in  post -test  probabilities  based 
on  the  lower  limit  of  the  95CI.  Third,  the  use  of  an  EMG/NCS  gold  standard  may  have 
prevented  identification  of  subjects  who  had  CR  and  CTS.  No  gold  standard  is  perfect40 
and  we  chose  to  maximize  specificity  since  diagnosis  and  not  screening  was  the  primary 
purpose  of  our  clinical  examination  test  items.  This  was  reflected  in  the  ratio  of  LR+  to 
LR-  findings  that  were  considered  acceptable.  Use  of  a  different  gold  standard  may 
result  in  different  diagnostic  characteristics  for  the  test  items  in  this  study.  Fourth,  our 
outcome  gold  standard  of  surgery  and  change  may  not  be  useful  or  the  best  gold 
standards  for  assessing  intervention  and  change,  respectively.  It  is  well  known  health 
care  practices  may  vary  by  geographic  region291  and  our  results  may  not  be  generalizable 
in  other  settings.  The  use  of  a  GRCS  for  the  assessment  of  patient  status  has  been 
criticized  as  being  biased.147  However,  few  if  any  better  alternatives  have  been 
determined.  Finally,  all  predictive  analyses  were  short-term.  The  predictive  properties  of 
the  test  items  in  this  study  may  be  different  for  a  longer  period  of  time. 

This  study  supports  the  contentions31  and  findings28'292  of  others  that  information 
contained  the  patient  history  is  often  the  most  powerful  diagnostic  tool  available  and 
often  supplies  the  clinician  with  everything  needed  to  clinch  a  diagnosis.  The  clinical 
examination  test  items  with  the  most  powerful  LR  value  for  subjects  who  underwent 
surgery  and  worsened  were  questions.  Often,  multiple  questions  had  highly  predictive 
LR  values.  Indeed,  based  on  the  reliability  and  diagnostic  accuracy  coefficients,  the 
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“hardest”  data  in  this  study  were  those  usually  thought  to  be  “soft”.239  This  study  also 
supports  the  observation  that  when  multiple,  appropriate  test  items  are  combined  into  a 
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single  TIC,  diagnostic  power  is  often  greatly  improved.  p 

The  vast  majority  of  clinical  examination  items  in  this  study  was  found  to  have 
acceptable  reliability  and  is  most  likely  due  to  operationally  defining  our  tests  and 
standardizing  the  examination  protocol.  Two  clinical  examination  items  that  are  of 
particular  interest  are  question  7-“Sympotms  keep  from  sleep..”  and  the  ULTT  A  for  the 
diagnosis  of  CR.  Both  LR  values  were  acceptable  for  Question  7  and  its  LR+=  6.5  was 
the  only  single  test  item  value  in  this  study  that  was  definitive.  The  ULTT  was  perfectly 
sensitive  (950—  .87  —  1.0)  and  had  an  LR-=  .15.  By  comparison,  the  computed  LR- 
value  of  the  straight-leg  raise,  a  commonly  used  screening  test  for  lumbar  radiculopathy, 
is  only  .33.  Further  investigation  may  demonstrate  that  this  CR  TIC,  a  simple  question, 
or  a  single  examination  procedure  can  be  very  powerful  and  inexpensive  tools  useful  for 
the  diagnosis  and  screening  of  patients  with  CR. 

The  majority  of  individual  test  items  and  TIC  for  the  CTS  subjects  were  less  powerful 
than  were  the  tests  and  TIC  for  the  diagnosis  of  CR.  Spectrum  bias  did  appear  to  have  an 
impact  of  the  diagnostic  characteristics  of  the  tests  for  the  CTS  group  as  a  whole. 
Interestingly,  the  two  test  items  with  the  best  diagnostic  power  for  CTS  were  the 
HSAM’s.  These  two  scales  were  developed  for  evaluative  purposes,  which  usually 
diminishes  the  discriminative  properties  of  the  scales.142  This  was  not  the  case  in  our 
study. 

Unfortunately,  the  predictive  validity  of  clinical  examination  items  for  CR  were  unable  to 
be  assessed  in  this  study.  The  predictive  validity  of  clinical  examination  items  for  CTS 
will  need  to  be  tested  in  different  settings  and  over  a  longer  period  of  time.  Longer 
follow-up  times  are  required  to  determine  which  subjects  with  CTS  will  improve  as  the 
natural  course  of  this  condition  may  prolonged  and  in  many  cases  unfavorable.15 
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A  replication  study,  in  part  or  whole,  with  a  larger  sample  is  required  to  improve  the 
precision  of  point  estimates  obtained  for  the  clinical  examination  test  items  in  this  study. 
The  TIC’s  developed  in  this  study  must  be  examined  in  an  independent  sample  before 
their  utility  can  be  adequately  assessed.  Clinimetrics  involves  the  quantifying  of  data  that 
are  observed,  judged,  and  decided  on  during  the  clinical  examination  by  clinicians 
themselves290pp  1  ‘ 5  This  work  should  be  regarded  as  an  attempt  toward  developing 
Clinimetrics  for  two  common  musculoskeletal  disorders  regularly  encountered  in  clinical 
practice.  The  tests  items  and  TIC’s  developed  and  assessed  in  this  study  are  sensible, 
have  face  validity,  include  a  formal  expression  of  the  index  of  interest,  and  are  certainly 
easy  to  use.290pp  1  ’ 5' 141  * 166  Further  work  is  needed  to  establish  and  validate  the  content 
of  the  clinical  examination  and  TIC’s  used  in  this  study. 

7.6  Conclusion 

Of  the  54  clinical  examination  tests  in  this  study,  the  majority  were  found  to  have  Fair  to 
Good  or  Good  reliability  and  a  few  had  Excellent  reliability;  the  reliability  of  1 1  of  these 
items  was  poor  to  moderate.  Two  CR  test  items  had  a  definitively  acceptable  level  of  Sn 
while  no  CTS  test  items  were  definitively  acceptable.  A  number  of  CR  clinical 
examination  items  had  a  definitively  acceptable  level  of  Sp  and  three  CTS  test  items  were 
found  to  have  a  definitively  acceptable  level  of  Sp.  Likelihood  ratio  point  estimates 
indicated  that  several  test  items  and  TIC’s  are  potentially  useful  for  the  diagnosis  and 
prognosis  of  CR  and  CTS.  Question  number  7  was  found  to  have  a  definitively 
acceptable  LR+,  as  were  the  CR  TIC  and  the  surgery  TIC.  None  of  the  definitive  LR+- 
values  had  lower  bounds  that  would  result  in  post-test  probabilitie  changes  larger  than 
33%. 
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Cervical  Radiculopathy  and  CTS:  A  Prospective 
Study  of  Test  Reliability,  Diagnostic  Accuracy  and 

Predictive  Velidity^ _ Referral  Sources: 

1°  Orthopedics 

/  V  ^  \  Neurosurgery 

Hand  clinic/surgery' 

2°  FPC,  other 


*EMG  lab  Technician. 
Resident,  Staff  informs 
patient  of  study,  invites 
patient  to  participate 


Pt.  referred  for 
radiculopahy  8d or  CTS 
that  meet  entry  criteria. 


Enter  study? 


EMG/NCS 


+  Dx  Radio.  Or  CTS? 
(Hx  &  EMG/NCS) 


-i— - - —  ■■■■ - 

PT  Hx  &  UQ/Neck  Eval  Proc. 

(Rater  1  &2,  Blinded  to  EMG  results  &  Dx) 


Patiant  completes  NDL 

HSFS.  VAS.  FABQ. 

&  Hand  Diagram 


Prescribed 

CTS  Rx 


DATA  ANALYSIS 

Initial:  Reliability  &  Dx  accuracy 
-Clinical  Examination 
-Self-report  Instruments  (SRI) 
(single  &  in  clusters) 

Metric:  Sn/Sp  &  Likelihood  Ratios 
Cold  standard:  Neural  Impairment 
based  on  positive  EMG/NCS  study 

F/U:6  wks  &  6  mo:  Predictive  Validity 
_J\  -Clinical  Examination  / 


Prescribed 

RADIC.  Rx 

Surg  Non-Surg 


F/U 
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Multi-center  Study  of  Cervical  Radiculopathy 
and  Carpal  Tunnel  Syndrome  (CTS) 

Will  you  consider  being  a  participant  in  our  study?  The  physical  therapy  clinic  and  EMG 
lab  are  trying  to  determine  the  usefulness  of  commonly  used  clinical  examination 
procedures  and  questionnaires  that  are  used  to  diagnose  patients  with  your  suspected 
condition  (either  cervical  radiculopathy  and/or  CTS).  To  compensate  you  for  your  time, 

you  will  be  paid  up  to  $25.00  for  returning  data  collection  forms. 


Specifically,  we  want  to  determine  the  reliability  and  accuracy  of  the  procedures  and 
questionnaires  and  find  out  whether  they  can  help  us  predict  how  well  you  will  respond 
to  treatment.  Often  times  more  expensive  and  invasive  tests  (such  as  EMG)  can  be 
avoided  if  clinical  tests  for  detecting  the  condition  have  been  shown  to  be  reliable  and 
accurate. 

If  you  choose  to  help  us,  you  will  fill  out  some  questionnaires  and  a  standardized  EMG 
examination  of  your  affected  limb  will  be  performed.  Having  a  "standardized"  EMG 
examination  means  two  extra  nerve  conduction  procedures  will  be  performed  and 
possibly  1-4  extra  muscles  will  be  examined  with  an  EMG  pin  electrode.  We  say 
"possibly"  because  the  muscles  to  be  tested  during  the  standardized  examination  are 
often  tested  anyway  when  performing  an  EMG  examination  on  patients  with  your 
suspected  condition. 

Following  your  EMG  examination,  a  physical  therapist  will  ask  you  some  questions 
about  your  condition,  take  some  measurements  of  your  neck  and  wrists,  and  perform 
several  tests  designed  to  alter  your  symptoms.  After  a  brief  rest  period,  a  second 
physical  therapist  will  repeat  the  same  examination.  Finally,  at  six  weeks  and  six 
months  from  now  you  will  be  mailed  four  of  the  same  questionnaires  you  filled  out  today 
and  a  scale  asking  you  rate  your  improvement  along  with  a  self-addressed  stamped 
return  envelope.  Once  the  we  receive  the  forms,  paperwork  will  be  processed  to  issue 
you  a  check  in  your  name  which  you  will  receive  in  the  mail:  $15.00  for  the  6  week 
forms  and  $10.00  for  the  6  month  forms.  You  will  incur  absolutely  no  additional  cost  if 
you  choose  to  participate  in  this  study. 

Well,  in  a  nut-shell  that’s  what  your  participation  in  this  study  will  involve.  The 
technicians,  therapists,  or  doctors  involved  in  your  care  are  happy  to  answer  any 
questions  you  may  have.  And,  of  course,  a  more  detailed  description  of  the  study  is 
contained  in  the  patient  consent  form  that  you  must  read  and  sign  before  you 
participate. 

Thank  you  very  much  for  considering  participating  in  this  important  study.  Your 
assistance  will  help  us  answer  our  questions.  The  answers  we  obtain  could  help  future 
sufferers  of  cervical  radiculopathy  and  CTS  by  providing  a  way  for  their  condition  to  be 
detected  more  quickly  and  comfortably  and  predicting  how  they  may  respond  to  certain 
kinds  of  treatment. 
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Screening  Questions  for  Study  Participation: 

Name__ _ Age 

1.  Referred  for:  Cervical  radiculopathy  □  CTS  □  Side  affected:  LD  RD,  DomD  NonDomD 

2.  When  did  this  current  episode  of  symptoms  first  begin? _ 

3.  Is  this  the  first  time  you  have  experienced  the  symptoms  of  this  conditon?  Yes  □  No  □ 

4.  If  no,  how  many  previous  episodes  of  this  condition  have  you  experienced 
in  the  past? 

Only  one  other  time  D  Two  to  Three  times  □  Four  or  more  times  D 

5.  Have  you  ever  had  an  EMG  examination  of  your  affected  limb  in  the  past 

for  this  same  condition  ?  YesD  No  □ 

6.  Have  you  received  treatment  or  are  you  currently  being  treated  for 

this  condition?  YesD  No  □ 

(check  all  that  apply):  cervical  traction  □  exercise  □  wrist  splint  □  injections  □  other  □ 

7.  Do  you  take  medications  for  your  upper  limb  symptoms  or  any  other  condition?  YesD  NoD 

(If  yes,  list  medications: _  \ 


8.  Have  you  ever  had  6  or  more  drinks  during  the  course  of  your  workday? 

Carefully  read  the  following  list  and  check  the  box  that  applies  to  you: 

Do  you  have  any  of  the  following  conditions? 


Diabetes 

Yes 

□ 

No 

□ 

Thyroid  disease 

□ 

□ 

Cancer 

□ 

□ 

Kidney  disease 

□ 

□ 

Rheumatoid  arthritis 

□ 

□ 

Any  history  of  stroke  or  neurological  disease 

□ 

□ 

Blood  clotting  disorder 

□ 

□ 

Exposure  to  lead,  mercury,  or  industrial  solvents 

□ 

□ 

Prior  fractures  of  your  affected  hand,  wrist  or 
neck 

your 

□ 

□ 

Prior  surgery  of  your  affected  hand,  wrist,  or  your 
Neck  □ 

□ 

Numbness  and  tingling  of  both  arms  and/or 
both  legs 

□ 

□ 

Pregnant 

□ 

□ 

Off  work  for  longer  than  6  months 
due  to  your  symptoms 

□ 

□ 
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D/M/Y 


63491  Multicenter 
Cervical  Radiculopathy  &  CTS  Study 


/  1  9 


Last  Name 


First  Name 


Address 


Address 


Home  Phone 


Subject  ID  #  Facility  *  Form  *  ~ 

I  I  I  I-!  Ml  1=P|" 

_J_J  UJ  2=BAMC 


1=Pitt 

2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

6=FT  Camp. 


Gender  O  Male  O  Female 


Is  there  Workers  O  No  O  Yes 

compensation  involved  in  your 

Is  there  litigation  pending  or  a 

settlement  involved  in  your  O  No  O  Yes 

case? 


State  Zip 


Work  Phone 


Contact  Person  or  Relative  Not  Living  with  You 


Last  Name 


First  Name 


Address 


Address 


Home  Phone 


State  Zip 


Work  Phone 


Disentry 


5532 


NDI  FORM 


Evai  Period:  O  Initial  D/M/Y 
O  6  weeks 
O  6  months 


/ 


Subject 


/ 


1 

9 

D  #  Facility  #  Form  # 


8 


i  i  iui  i  iuci 

1=Pitt  B 

2=BAMC  1 

3=v\ji5rgrc  B 

4=AFA 
5=MGMC 
6=FT  Camp. 


Please  Read:  This  Questionnaire  is  designed  to  give  the  doctor  information  as  to  how  your  neck  pain  has  affected  your  ability  to  manage  in 
Please  answer  every  section  and  mark  in  each  section  only  the  ONE  box  which  applies  to  you.  We  realize  you  may  consider  that  two  of  tl 
any  one  section  may  relate  to  you,  but  please  just  mark  the  box  which  most  closely  describes  your  problem. 


Section  1  -  Pain  Intensity 

O  I  have  no  pain  at  the  moment. 

O  The  pain  is  very  mild  at  the  moment. 

O  The  pain  is  moderate  at  the  moment. 

O  The  pain  is  fairly  severe  at  the  moment. 

O  The  pain  is  very  severe  at  the  moment. 

O  The  pain  is  the  worst  imaginable  at  the  moment. 

Section  2  -  Personal  Care 


Section  6  -  Concentration 

O  I  can  concentrate  fully  when  I  want  to  with  no  difficulty. 

O  I  can  concentrate  fully  when  I  want  to  with  slight  difficulty. 

O  I  have  a  fair  degree  of  difficulty  in  concentrating  when  I  want  to. 
O  I  have  a  lot  of  difficulty  in  concentrating  when  I  want  to. 

O  I  have  a  great  deal  of  difficulty  in  concentrating  when  I  want  to. 
O  1  cannot  concentrate  at  all. 

Section  7  -  Work 


O  I  can  look  after  myself  normally  without  causing  extra 

O  I  can  look  after  myself  normally  but  it  causes  extra  pain. 

q  It  is  painful  to  look  after  myself  and  I  am  slow  and 
careful. 

O  I  need  some  help  but  manage  most  of  my  personal  care. 

O  I  need  help  everyday  in  most  aspects  of  self  care. 

q  I  do  not  get  dressed,  I  wash  with  difficulty  and  stay  in 
bed. 


i  (Skip  if  You  have  not  attempted  liftin 

Section  3  -  Lifting  sjnce  the  onset  of  your  neck  pajn) 

O  I  can  lift  heavy  weights  without  extra  pain 


O  I  can  lift  heavy  weights  but  it  causes  extra  pain 

Pain  prevents  me  lifting  heavy  wieghts  off  the  floor,  but  I  can 
^  manage  if  they  are  conveniently  positioned,  e.g.  on  a  table 
O  Pain  prevents  me  lifting  heavy  wieghts  but  I  can  manage 
light  to  medium  wieghts  if  they  are  conveniently  positioned. 
O  I  can  only  lift  very  light  weights. 

O  I  cannot  lift  or  carry  anything  at  all. 


Section  4  -  Reading 

q  I  can  read  as  much  as  I  want  to  with  no  pain  in  my 
neck 

O  I  can  read  as  much  as  I  want  to  with  slight  pain  in  my 

O  l  can  read  as  much  as  I  want  with  moderate  pain  in  my  neck. 

q  I  can't  read  as  much  as  I  want  because  of  moderate  pain  in 
my  neck. 

O  I  can  hardly  read  at  all  because  of  severe  pain  in  my  neck 
O  I  cannot  read  at  all. 

Section  5  -  Headaches 

O  I  have  no  headaches  at  all. 

O  I  have  slight  headaches  which  come  infrequently. 

O  I  have  moderate  headaches  which  come  infrequently. 

O  I  have  moderate  headaches  which  come  frequently. 

O  I  have  severe  headaches  which  come  frequently. 

O  I  have  headaches  all  the  time. 


O  l  can  do  as  much  work  as  I  want  to. 

O  I  can  only  do  my  usual  work,  but  no  more. 

O  I  can  do  most  of  my  usual  work,  but  no  more. 

O  I  cannot  do  my  usual  work. 

O  I  can  hardly  do  any  work  at  all. 

O  I  can't  do  any  work  at  all. 

Section  8  -  Driving 

O  I  can  drive  my  car  without  any  neck  pain. 

pv  I  can  drive  my  car  as  long  as  I  want  with  slight  pain  in 
u  neck. 

O  I  can  drive  my  car  as  long  as  I  want  with  moderate  pai 
my  neck. 

O  I  can't  drive  my  car  as  long  as  I  want  because  of  mode 
pain  in  my  neck. 

O  1  can  hardly  drive  at  all  because  of  severe  pain  in  my  r 

O  I  can't  drive  my  car  at  all. 

Section  9  -  Sleeping 

O  I  have  no  trouble  sleeping 

O  My  sleep  is  slightly  disturbed(  less  than  1  hr.  sleepless) 

O  My  sleep  is  mildly  disturbed  (1-2  hrs.  sleepless) 

O  My  sleep  is  moderayely  disturbed(2-3  hrs.  sleepless) 

O  My  sleep  is  greatly  disturbed(3-5  hrs.  sleepless) 

O  My  sleep  is  completely  disturbed(5-7  hrs.  sleepless) 

Section  10  -  Recreation 

O  I  am  able  to  engage  in  all  my  recreation  activities  with  no  neck 
pain  at  all. 

O  I  am  able  to  engage  in  all  my  recreation  activities,  with  some  pain 
in  my  neck. 

O  I  am  able  to  engage  in  most,  but  not  all  of  my  usual  recreation 
activities  because  of  pain  in  my  neck. 

O  I  am  able  to  engage  in  a  few  of  my  usual  recreation  activities 
because  of  pain  in  my  neck. 

O  I  can  hardly  do  any  recreation  activities  because  of  pain  in  my 
neck 


O  I  can't  do  any  recreation  activities  at  all. 

Note:  Please  give  only  ^Answer  per  question 
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Eval  Period:  O  Initial  D/M/Y 

/ 

/ 

1 

9 

uoweeKs  Subject  ID#  Facility# 

Form  # 

O  6  months 

1 

0 

SX  Severity  Scale 


Facility  number 

1  =Pitt  198 

2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

6=FT  Camp. 


The  following  questions  refer  to  your  symptoms  for  a  typical  24  hr.  period  during  the  last  two 
weeks  (circle  one  answer  to  each  question). 


1.  How  severe  is  the  hand  or  wrist  pain  that  you  have  at  night? 
O  I  do  not  have  hand  or  wrist  pain  at  night 

O  Mild  pain 
O  Moderate  pain 
O  Severe  pain 
O  Very  severe  pain 

2.  How  often  did  hand  or  wrist  pain  wake  you  up  during  a 
typical  night  in  the  past  two  weeks? 

O  Never 

O  Once 

O  Two  to  three  times 
O  Four  or  five  times 
O  More  than  five  times 

3.  Do  you  typically  have  pain  in  your  hand  or  wrist  during 
the  daytime? 

O  I  never  have  pain  during  the  day. 

O  I  have  mild  pain  during  the  day. 

O  I  have  moderate  pain  during  the  day. 

O  I  have  severe  pain  during  the  day. 

O  I  have  very  severe  pain  during  the  day. 

4.  How  often  do  you  have  hand  or  wrist  pain  during  the 
daytime? 

O  Never 

O  Once  or  twice  a  day. 

O  Three  to  five  times  a  day. 

O  More  than  five  times  a  day. 

O  The  pain  is  constant. 

5.  How  long,  on  average,  does  an  episode  of  pain  last 
during  the  daytime? 

O  I  never  get  pain  during  the  day. 

O  Less  than  10  minutes. 

O  10  to  60  minutes. 

O  Greater  than  60  minutes. 

O  The  pain  is  constant  throughout  the  day. 


6.  Do  you  have  numbness  (loss  of  sensation)  in  your  hand? 

ONo 

O  I  have  mild  numbness 
O  I  have  moderate  numbness. 

O  I  have  severe  numbness. 

O  I  have  very  severe  numbness. 

7.  Do  you  have  weakness  in  your  hand  or  wrist  ? 

O  No  weakness 

O  Mild  weakness. 

O  Moderate  weakness. 

O  Severe  weakness. 

O  Very  severe  weakness. 

8.  Do  you  have  tingling  sensations  in  your  hand? 

O  No  tingling. 

O  Mild  tingling. 

O  Moderate  tingling 
O  Severe  tingling. 

O  Very  severe  tingling. 

9.  How  severe  is  numbness  (loss  of  sensation)  or  tingling  at  night? 
O  I  have  no  numbness  or  tingling  at  night. 

O  Mild 
O  Moderate 
O  Severe 
O  Very  Severe 

10.  How  often  did  hand  numbness  or  tingling  wake  you  up  during  a 
typical  night  during  the  past  two  weeks? 

O  Never 
O  Once 

O  Two  or  three  times 
O  Four  or  five  times 
O  More  than  five  times 

11.  Do  you  have  difficulty  with  the  grasping  and  use  of  small 
objects  such  as  keys  or  pens? 

O  No  difficulty 
O  Mild  difficulty 
O  Moderate  difficulty 
O  Severe  difficulty 
O  Very  severe  difficulty 
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HAND  DIAGRAM 
&  VAS  FORM 


Eval  Period:  O  initial  DIMrY 

/  / 

1  9 

O  6  weeks  Subject  ID  #  Facility  if  Form  # 

O  6  months 

9 

Facility  numbe 
1=Pitt200 


2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 


6=FT  Camp. 


Using  the  key  shown  below,  please  draw  in  on  the  diagram  the  areas  of  each  hand/arm  where  you 
have  felt  pain,  numbness,  tingling,  or  other  types  of  discomfort  on  a  typical  day  during  the  PAST  TWO 


WEEKS. 


Key: 


XXXX . Pain 

llll  ....Numbness/tingling 

0000  ....Other  discomfort  (please  describe: 


LEFT  HAND/ARM 


RIGHT  HAND/ARM 


Please  use  the  three  scales  below  to  rate  your  pain  over  the  past  24  hours.  Use  the  upper  line  to  rate  your 
pain  level  right  now. 


Use  the  other  scales  to  rate  your  pain  at  its  worst  and  best  over  the  past  24 


Right  Now: 

NO  PAIN  _ 
Worst  past  24  hrs. 
NO  PAIN  _ 

Best  past  24  hrs. 
NO  PAIN  - 


hours. 


WORST 

POSSIBLE  PAIN 


WORST 
POSSIBLE  PAIN 

WORST 
POSSIBLE  PAIN 
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FABQ  (activity] 


Eval  Period:  O  initial  D/M/Y 
O  6  weeks 


/ 

/ 

1 

9 

O  6  months 


Subject  ID  #  Facility  #  Form  # 


1 


Facility  numbe 

3=WHMC 
4=AFA 
5=MGMC 
6=FT  Camp. 


Here  are  some  of  the  things  other  patients  have  told  us  about  their  pain.  For  each  statement 
please  mark  the  number  from  0  to  6  to  indicate  how  much  physical  activities  such  as  bending, 
lifting,  walking  or  driving  affect  or  would  affect  your  neck  and/or  hand  pain. 


Completely 

Disagree 

Unsure 

Completely 

Agree 

1.  My  pain  was  caused  by  physical 
activity 

oo 

oi 

02 

03 

04 

05 

06 

2.  Physical  activity  makes  my  pain  worse 

oo 

01 

02 

03 

0  4 

05 

06 

3.  Physical  activity  might  harm  my  neck  an 
wrist 

oo 

O  1 

02 

03 

04 

05 

06 

4.  1  should  not  do  physical  activities  whict 
(might)  make  my  pain  worse 

oo 

O  1 

02 

03 

04 

05 

06 

5.  1  cannot  do  physical  activities  which 
(might)  make  my  pain  worse 

oo 

0  1 

02 

03 

04 

05 

06 

FABQ  1/2 
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FABQ  (work) 


Eval  Period:  O  initial  Date 
O  6  weeks 
O  6  months 


/ 

/ 

1 

9 

Subject  ID  #  Facility  #  Form  # 

in 


FaciJl 
nur 
1=  Pitt 
2=  NNMC 
3=  WHMC 
4=  AFA 


The  following  statements  are  about  how  your  normal  work  affects 
or  would  affect  your  neck  and/or  hand  pain. 


Completely 

Disagree 

Unsure 

Com  pletely 
Agree 

6.  My  pain  was  caused  by  my  work  or 
accident  at  work 

O0 

01 

02 

03 

04 

05 

06 

7.  My  work  aggravated  my  pain 

oo 

01 

02 

03 

04 

05 

06 

8.  1  have  a  claim  for  compensation  for 
mv  pain 

oo 

01 

02 

03 

04 

05 

06 

9.  My  work  is  too  heavy  for  me 

oo 

O  1 

02 

03 

04 

05 

06 

10.  My  work  makes  orwould  make  my 
pain  worse 

oo 

01 

02 

03 

04 

05 

06 

1 1 .  My  work  might  harm  my  neck  and/or 
hand 

oo 

0  1 

02 

03 

04 

05 

06 

12.  1  should  not  do  my  regular  work  with 
my  present  pain 

oo 

O  1 

02 

03 

04 

05 

06 

13.  1  cannot  do  my  normal  work  with  my 
present  pain 

oo 

Ol 

02 

03 

04 

05 

06 

14.  1  cannot  do  my  normal  work  until  my 
pain  is  treated 

oo 

O  1 

02 

03 

04 

05 

06 

15.  1  do  not  think  that  1  will  be  back  to  my 
normal  work  within  3  months 

oo 

0  1 

02 

03 

04 

05 

06 

16.  Ido  not  think  that  1  will  ever  be  able 

to  go  back  to  that  work 

oo 

0  1 

0  2 

0  3 

04 

05 

06 

H  FABQ  2/2 

Eval  Period:  D/M/Y 

O  6  weeks 
O  6  months 


FOLLOW-UP  &  06mon,hs  p~ 

OUTCOME  - 

1.  Since  your  EMG/NCS  test,  have  you  HAD  surgery? 

2.  Since  your  EMG/NCS  test,  have  you  BEEN  OFFERED  surgery? 


Subject  ID  #  Facility  #  Form  # 

i  n  RTT 


O  No  O  Yes 
O  No  O  Yes 


Facility  number 

1=Pitt 

2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

6=FT  Camp. 


3.  Compared  to  your  condition  prior  to  treatment,  which  item  on  the  scale  below  best  describes  your 
present  condition  (choose  only  one): 

Patient  Global  Rating  Scale 

O  A  very  great  deal  worse 
O  A  great  deal  worse 
O  Quite  a  bit  worse 
O  Moderately  worse 
O  Somewhat  worse 
O  A  little  bit  worse 
O  A  tiny  bit  worse  (almost  the  same) 

O  About  the  same 
O  A  tiny  bit  better  (almost  the  same) 

O  A  little  bit  better 
O  Somewhat  better 
O  Moderately  better 
O  Quite  a  bit  better 
O  A  great  deal  better 
O  A  very  great  deal  better 

4.  Please  check  any  of  the  following  treatments  you  have  received  for  your  condition  (check  all  that  apply): 

Medication 

ONone  OAnti-inflammitory  O  Narcotics  O  Tylenol  O  Steroids  O  Other 

(Motrin,  Advil, 

Naproxsvn.  etc.) 

Other  Conservative  Treatments: 

O  None  O  Collar  O  Traction  O  Manipulation  O  Exercise  O  Wrist  Splint  O  Injection 

5.  Please  check  any  of  the  following  treatments  you  are  still  receiving  for  your  condition  (check  all  that  apply): 
Medication 

ONone  OAnti-inflammitory  O  Narcotics  O  Tylenol  O  Steroids  O  Other 

(Motrin,  Advil, 

Naoroxsvn.  etc.) 

Other  Conservative  Treatments: 

ONone  O  Collar  O  Traction  O  Manipulation  O  Exercise  O  Wrist  Splint  O  Injection 


Is  there  anything  else  you  would  like  to  tells  us  about  your  condition? 
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QUESTION  FORM 


D/M/Y 

/ 

/ 

1 

9 

Facility  number 

1=Pltt  205 
2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

6=FT  Camp. 

Subject  ID  #  Facility  #  Rater# 

Exam#  Form  # 

_1J  [6 

1.  Which  of  the  following  symptoms  are  most  bothersome  for  you?  (choose  one) 
O  Pain 

O  Numbness  &  Tingling 
O  Loss  of  feeling 

2.  Where  are  your  symptoms  most  bothersome? 

O  Neck 

O  Shoulder  or  shoulder  blade 
O  Arm  above  elbow 
O  Arm  below  elbow 
O  Hands  and/or  fingers 

3.  Which  of  the  following  best  describes  the  behaviour  of  your  symptoms? 

O  Constant 

O  Intermittent  (symptoms  come  &  go) 

O  Variable  (symptoms  improve  or  worsen  at  times) 


4.  Does  your  affected  hand  feel  "fat"  or  "swollen"? 

5.  Do  you  have  trouble  with  fumbling  or  dropping  objects  from  your  affected  hand? 

6.  Does  your  entire  affected  limb  and/or  hand  feel  numb? 

7.  Do  your  symptoms  keep  you  from  falling  asleep  at  night? 

8.  Do  your  symptoms  wake  you  during  the  night? 

9.  Do  your  symptoms  improve  with  moving  or  positioning  your  neck? 

10.  Do  your  symptoms  improve  with  moving,  "shaking",  or  positioning  your  wrist  or  hands? 

11.  Are  your  symptoms  brought  on  or  made  worse  when  performing  tasks  that  require  a  lot 
or  grasping  or  hand  and/or  finger  use? 


NO 

0  YES 

NO 

OYES 

NO 

OYES 

NO 

OYES 

NO 

OYES 

NO 

OYES 

NO 

OYES 

NO 

OYES 

■  “Tn 
_ ■ 

Dm"  i  /  [7 

9 

i 

Facility  number  j 

1=Pitt  206 
2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

C-CT  -  h 

|  ■ 

5140 

QUESTION  FORM 

Subject  ID  it  Facility  it  Rater # 

Exam#  Form  # 

®  @ 

6=FT  Camp.  r 

i 


1.  Which  of  the  following  symptoms  are  most  bothersome  for  you?  (choose  one) 
O  Pain 

O  Numbness  &  Tingling 
O  Loss  of  feeling 

2.  Where  are  your  symptoms  most  bothersome? 

O  Neck 

O  Shoulder  or  shoulder  blade 
O  Arm  above  elbow 
O  Arm  below  elbow 
O  Hands  and/or  fingers 

3.  Which  of  the  following  best  describes  the  behaviour  of  your  symptoms? 

O  Constant 

O  Intermittent  (symptoms  come  &  go) 

O  Variable  (symptoms  improve  or  worsen  at  times) 


4.  Does  your  affected  hand  feel  "fat"  or  "swollen"? 

5.  Do  you  have  trouble  with  fumbling  or  dropping  objects  from  your  affected  hand? 

6.  Does  your  entire  affected  limb  and/or  hand  feel  numb? 

7.  Do  your  symptoms  keep  you  from  falling  asleep  at  night? 

8.  Do  your  symptoms  wake  you  during  the  night? 

9.  Do  your  symptoms  improve  with  moving  or  positioning  your  neck? 

10.  Do  your  symptoms  improve  with  moving,  "shaking",  or  positioning  your  wrist  or  hands? 

11.  Are  your  symptoms  brought  on  or  made  worse  when  performing  tasks  that  require  a  lot 
or  grasping  or  hand  and/or  finger  use? 


O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 
O  NO  O  YES 


EXAM  FORM 


Involved  Side 

O  Left  O  Right 


Wrist  Ratio: 


Ant-Post  (mm): 


Med-Lat  (mm): 


D/M/Y  — I - 1  !  — 1 — I  /  g  I  I  I 

• - 1 -  I - - 1  - 1—1 - 1 -  2=BAMC 

Subject  ID  #  Facility  #  Rater#  Exam#  Form  #  3=WHMC 


I]  G  0  E 


Measurements: 


L  ROT 


Provocative  Tests: 


Extension 


4=AFA 
5=MGMC 
6=FT  Camp. 


Cervical  ROM 


Flexion 


R  ROT 


Spurling's  A 

Spurling's  B 

Shoulder 

Abduction 

Valsalva 
Distraction 
Tinel's  A 
Tinel’s  B 


NOTES: 


NOT 

TOL. 

O 


ULTT  GRADING:  0=Negative; 

A:  1-6, s/s  diff.=Positive  B:  1-5, s/s  diff.=Positive  s(s  not 

diff.  T0L 

ULTT  A  OO  Ol  02  03  04  05  06  O  O 


ULTT  B 


OO  01  02  03  04  05 


Timed  Tests: 


CCT  (30  sec.) 

O  Negative 
O  Pos.  <15 
O  Pos.  <30 
O  NotTol. 


Phalen's  (60  sec.) 

O  Negative 
O  Pos.  <15 
O  Pos.  <30 
O  Pos.  <45 
O  Pos.  <60 
O  NotTol. 
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EXAM  FORM 

Involved  Side 

O  Left  O  Right 


D/M/Y 

/  ' 

/  1  9 

Subject  ID  # 

Facility  #  Rater#  Exam#  Form  # 

n  [2 

7 

Measurements: 


Facility  nu3G&r  j 
1=Pitt 
2=BAMC 
3=WHMC 
4=AFA 
5=MGMC 
6=FT  Camp, 


Cervical  ROM 


Wrist  Ratio: 

Ant-Post  (mm): 


Med-Lat  (mm): 


Flexion 


L  ROT  1 - 1 - 1 - 1  R  ROT 


Provocative  Tests: 


Spurling's  A 

Spurling's  B 

Shoulder 

Abduction 

Valsalva 

Distraction 
Tinel's  A 
Tinel’s  B 


NOTES: 


NEG 

O 

O 

o 

o 

o 

o 

o 


POS 

o 

o 

o 

o 

o 

o 

o 


NOT 

TOL. 

o 

o 


o 

o 

o 

o 


ULTT  GRADING:  0=Negative; 

A:  1-6, s/s  diff.=Positive  B:  1-5, s/s  diff.=Positive 


O  Negative 
O  Pos.  <15 
O  Pos.  <30 
O  NotTol. 


O 

O 

o 

o 

o 

o 


Negative 
Pos.  <15 
Pos.  <30 
Pos.  <45 
Pos.  <60 
Not  Tol. 


s/s 

diff. 


ULTT  A  O  0  01  02  03  04  05  06  O 

ULTT  B  OO  0 1  02  03  04  05  O 

Timed  Tests: 

CCT  (30  sec.)  Phalen's  (60  sec.) 


NOT 

TOL. 

O 

O 


4337 


Neurologic 

Exam 


f - — - - - - - - - - - - 

Facility  number 
1=Pitt 

2=BAMC 

3=WHMC 

4=AFA 

5=MGMC 

6=FT  Camp. 

D/M/Y 

/ 

/  1  9 

Subject  ID#  Facility#  Rater#  Exam#  Form  # 

2 

^1 

5 

i  [ 

1.  MSRs 


Left 


Right 


Absent 

Reduced 

WNL 

Increased 

Absenl 

Reduced 

WNL 

Increased 

Biceps 

0  0 

O  1 

02 

03 

oo 

O  1 

02 

03 

Brachioradialis 

O  0 

01 

02 

03 

oo 

O  1 

02 

03 

Triceps 

oo 

0  1 

02 

03 

oo 

01 

02 

03 

2.  Sensory  Examination 

(paperclip  point)  “* 

Left 

Right 

Absenl 

Dermatomes 

Reduced 

WNL 

Increased 

Absenl 

Reduced 

WNL 

Increased 

C5  (lateral  deltoid) 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

C6  (rad.  aspect  of  index  f.) 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

C7  (dorsum  middle  f.) 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

C8  (med.  aspect  little  f.) 

0  0 

O  1 

02 

03 

oo 

O  1 

02 

03 

T1  (med.  aspect  mid-forearm) 

o  o 

O  1 

02 

03 

oo 

O  1 

02 

03 

Median  N.  Distribution 
(Palmar  surface  compared  to  thenar  skin) 

Left 

Right 

Thumb 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

Index  Finger 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

Middle  Finger 

0  0 

O  1 

02 

03 

oo 

O  1 

02 

03 

3.  Motor  Examination: 

Absent 

Markedly 
Reduced 
(P-to  F) 

Reduced 
(F+  to  G) 

Normal 

(N) 

Absent 

Markedly 
Reduced 
(P- to  F) 

Reduced 
(F+  to  G) 

Normal 

(N) 

Myotome 

C5 

Muscle 

deltoid 

OO 

0  1 

02 

03 

oo 

O  1 

02 

03 

C6 

biceps 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

ext  carp  rad 

o  o 

O  1 

02 

03 

oo 

O  1 

02 

03 

C7 

triceps 

o  o 

O  1 

02 

03 

oo 

O  1 

02 

03 

flex  carp  rad 

o  o 

O  1 

02 

O  3 

oo 

O  1 

0  2 

03 

C8 

abd  poll  brev 

o  o 

O  1 

02 

O  3 

oo 

O  1 

0  2 

03 

T1 

first  dorsal  Int 

oo 

O  1 

02 

0  3 

oo 

0  1 

0  2 

03 

4979 


Neurologic 

Exam 

1.  MSRs 

Biceps 

Brachioradialis 

Triceps 


2.  Sensory  Examination:  .  u 

^paperclip  point)  LeTT  Right 


A 

Dermatomes 

bsenl 

Reduced 

WNL 

Increased 

Absent 

Reduced 

WNL 

Increased 

C5  (lateral  deltoid) 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

C6  (rad.  aspect  of  index  f.) 

oo 

O  1 

02 

03 

OO 

O  1 

02 

03 

C7  (dorsum  middle  f.) 

oo 

O  1 

02 

03 

O  0 

01 

02 

03 

C8  (med.  aspect  little  f.) 

oo 

O  1 

02 

03 

oo 

0  1 

02 

03 

T1  (med.  aspect  mid-forearm) 

oo 

O  1  . 

02 

03 

OO 

0  1 

02 

03 

Median  N.  Distribution 

(Palmar  surface  compared  to  thenar  skin) 

Left 

Right 

Thumb 

oo 

O  1 

02 

03 

O  0 

O  1 

02 

03 

Index  Finger 

oo 

O  1 

02 

03  ; 

oo 

O  1 

02 

03 

Middle  Finger 

oo 

O  1 

02 

03  i 

oo 

0  1 

02 

03 

Markedly 

Markedly 

Absent 

Reduced 

Reduced 

Normal 

Absent 

Reduced 

Reduced 

Normal 

Motor  Examination: 

(P-  to  F) 

(F+  to  G) 

(N) 

(P- to  F) 

(F+  to  G) 

(N) 

Myotome 

Muscle 

C5 

deltoid 

oo 

O  1 

02 

O  3 

OO 

O  1 

02 

03 

C6 

biceps 

oo 

O  1 

02 

03 

oo 

O  1 

02 

03 

ext  carp  rad 

oo 

O  1 

02 

O  3 

oo 

O  1 

02 

03 

C7 

triceps 

oo 

O  1 

02 

03 

OO 

Ol 

02 

03 

flex  carp  rad 

oo 

O  1 

02 

O  3 

oo 

O  1 

O  2 

03 

C8 

abd  poll  brev 

oo 

O  1 

02 

O  3 

o  o 

O  1 

02 

O  3 

T1 

first  dorsal  Int 

oo 

O  1 

O  2 

O  3 

oo 

O  1 

02 

03 
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